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Abstract 



Quantum information theory is a multidisciplinary field wfiose objective is to under- 
stand what happens when information is stored in the state of a quantum system. 
Quantum mechanics provides us with a new resource, called quantum entanglement, 
which can be exploited to achieve novel tasks such as teleportation and superdense 
coding. Current technologies allow the transmission of entangled photon pairs across 
distances up to roughly 100 kilometers. For longer distances, noise arising from var- 
ious sources degrade the transmission of entanglement to the point that it becomes 
impossible to use the entanglement as a resource for future tasks. One strategy for 
dealing with this difficulty is to employ quantum repeaters, stations intermediate 
between the sender and receiver that can participate in the process of entanglement 
distillation, thereby improving on what the sender and receiver could do on their own. 

Motivated by the problem of designing quantum repeaters, we study entangle- 
ment distillation between two parties, Alice and Bob, starting from a mixed state and 
with the help of repeater stations. We extend the notion of entanglement of assis- 
tance to arbitrary tripartite states and exhibit a protocol, based on a random coding 
strategy, for extracting pure entanglement. We use these results to find achievable 
rates for the more general scenario, where many spatially separated repeaters help 
two recipients distill entanglement. 

We also study multiparty quantum communication protocols in a more general 
context. We give a new protocol for the task of multiparty state merging. The previ- 
ous multiparty state merging protocol required the use of time-sharing, an impossible 
strategy when a single copy of the input state is available to the parties. Our proto- 
col does not require time-sharing for distributed compression of two senders. In the 
one-shot regime, we can achieve multiparty state merging with entanglement costs 
not restricted to corner points of the entanglement cost region. Our analysis of the 
entanglement cost is performed using (smooth) min- and max-entropies. We illustrate 
the benefits of our approach by looking at different examples. 
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Resume 



L'informatique quantique a pour objectif de comprendre les proprietes de I'information 
lorsque celle-ci est representee par I'etat d'un systeme quantique. La mecanique 
quantique nous fournit une nouvelle ressource, I'intrication quantique, qui pent etre 
exploitee pour effectuer une teleportation quantique ou un codage superdense. Les 
technologies actuelles permettent la transmission de paires de photons intriques au 
moyen d'une fibre optique sur des distances maximales d'environ 100 kilometres. Au- 
dela de cette distance, les effets d'absorption et de dispersion degradent la qualite 
de I'intrication. Une strategic pour contrer ces difficultes consiste en I'utilisation de 
repeteurs quantiques: des stations intermediaires entre I'emetteur et le recepteur, qui 
peuvent etre utilisees durant le processus de distillation d'intrication, depassant ainsi 
ce que I'emetteur et le recepteur peuvent accomplir par eux-memes. 

Motives par le probleme precedent, nous etudions la distillation d'intrication 
entre deux parties a partir d'un etat mixte a I'aide de repeteurs quantiques. Nous 
etendons la notion d'intrication assistee aux etats tripartites arbitraires et presentons 
un protocole fonde sur une strategic de codage aleatoire. Nous utilisons ces resultats 
pour trouver des taux de distillation realisable dans le scenario le plus general, oil les 
deux parties ont recours a de nombreux repeteurs durant la distillation d'intrication. 

En etroite liaison avec la distillation d'intrication, nous etudions egalement les 
protocoles de communication quantique multipartite. Nous etablissons un nouveau 
protocole pour effectuer un transfert d'etat multipartite. Une caracteristique de 
notre protocole est sa capacite d'atteindre des taux qui ne correspondent pas a des 
points extremes de la region realisable sans I'utilisation d'une strategic de temps- 
partage. Nous effectuons une analyse du coiit d'intrication en utilisant les mesures 
d'entropie minimale et maximale et illustrons les avantages de notre approche a I'aide 
de differents exemples. Finalement, nous proposons une variante de notre protocole, 
oil deux recepteurs et plusieurs emetteurs partagent un etat mixte. Notre protocole, 
qui effectue un transfert partage, est applique au probleme de distillation assistee. 
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CHAPTER 1 



Introduction 

1.1 Motivation 

Information is a general concept which has many meanings, but is mostly under- 
stood as knowledge communicated between two entities. The science of information 
has origins dating back to the 19th century, with the works of Andrei Markov on 
probability theory and Ludwig Boltzmann on statistical mechanics. The founder of 
the theory is usually identified as Claude E. Shannon, who formalized the notion of 
information through the concepts of entropy and mutual information. These mea- 
sures characterize the limiting behavior of several operational quantities, such as the 
minimum compression length of a message or the capacity of transmitting information 
through a noisy channel. 

Quantum information theory is a multidisciplinary field whose objective is to un- 
derstand what happens when information is stored in the state of a quantum system. 
Quantum mechanics provides us with a new resource, called quantum entanglement, 
best explained from the words of Erwin Schrodinger, who coined the term in his 1935 
seminal paper "Discussion of probability relations between separated systems" [3]: 
When two systems, of which we know the states by their respective representatives, 
enter into temporary physical interaction due to known forces between them, and when 
after a time of mutual influence the systems separate again, then they can no longer 
be described in the same way as before, viz. by endowing each of them with a repre- 
sentative of its own. I would not call that one but rather the characteristic trait of 
quantum mechanics, the one that enforces its entire departure from classical lines of 
thought. By the interaction the two representatives [the quantum states] have become 
entangled. 
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Figure 1.1: Two different methods for establishing entanglement between Alice and Bob. 

Entanglement can be measured, transformed, and purified. It is essential to 
performing communication tasks such as quantum teleportation |1] and superdense 
coding [5] . It is also exploited for other computational and cryptographic tasks which 
are impossible for classical systems (for instance, cheating in a coin tossing challenge 
[B] or winning a pseudo-telepathy game [7]). 

Quantum teleportation, remote state preparation [8] and device-independent 
cryptography [9l [10] are examples of tasks which work on the assumption that 
entanglement can be shared between two spatially separated parties. To establish 
entanglement, the parties could meet at a common location and generate entangled 
pairs, with each party leaving with one half of each pair, or one of the parties could 
produce entanglement at his laboratory and send one half of each pair through a 
noiseless quantum channel (see Figure II. ip to the other party. The former strategy 
is currently infeasible as most quantum memories have very short storage times and 
are not designed to be moveable (see [H] for a review of quantum memories). As for 
the latter possibility, recent experiments [121 [H] have been successful at transmission 
of polarized entangled photons, with minimal loss of fidelity, over a distance of 144 
kilometers in free-space. (The maximum distance is roughly 100 kilometers for trans- 
mission through a fiber.) If Alice and Bob are located further away than this distance, 
absorption and dispersion effects will eventually degrade entanglement fidelity to the 
point of making long-range entanglement-based communication impossible. 

One strategy for dealing with this difficulty is to employ quantum repeaters, 
stations intermediate between the sender and receiver that can participate in the pro- 
cess of entanglement distillation, thereby improving on what the sender and receiver 
could do on their own [131 [T5l UHl [IT]. By introducing such stations between different 
laboratories, and possibly interconnecting a subset of them via fiber optics, we can 
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construct a quantum network (Figure [L2|l . 

Each node of the network represents local physical systems which hold quantum 
information, stored in quantum memories. The information stored at the node can 
then be processed locally by using optical beam splitters [18] and planar lightwave 
circuit technologies [19], among other technologies. Entanglement between neighbor- 
ing nodes can be established by locally preparing a state at one node and distributing 
part of it to the neighboring node using the physical medium connecting the two 
nodes. One of the main tasks then becomes the design of protocols that use the 
entanglement between the neighboring nodes to establish pure entanglement between 
the non adjacent nodes. 




Figure 1.2: A hypothetical quantum network connecting various university quantum lab- 
oratories. Repeater stations are represented by black dots. 

Let's consider the simplest non-trivial network, which was studied previously in 
[20] . and consists of two laboratories separated by a repeater station (see Figure [L3|) . 
At one endpoint of the network, Alice prepares an entangled system in the state 
1^)^*^^ = ^Al|00)^'^l + a/A^|11)^'^i, and sends the Ci part to the repeater station 
using the (noiseless) quantum channel connecting them. Without loss of generality, 
we can assume that Ai > A2. The repeater prepares an entangled system in the 
same state \'ip)^'^^'^ and transmits the C2 part to Bob. To establish entanglement 
between the laboratories, the repeater station performs a projective measurement on 
the composite system C1C2 with projectors corresponding to each of the four Bell 
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ACi 



l,C2B 





^/3^(Ai|00)±A2|ll)) 
^ •VVWWVVV^A/X/V'VVVi^^l "Q* 

FAIL ]Bf •Vv^/w^/w^yv^AA/^ * 

1*-) 

Figure 1.3: A one-dimensional chain with one repeater node. To perform entanglement 
swapping, the repeater node performs a Bell measurement. Depending on the 
outcome, Bob will follow with either a decoding operation (i.e a unitary) to 
get back a singlet, or a generalized measurement to produce a singlet with 
optimal probability. 



states: 



-Poo — 


1$, 




C1C2 


Poi = 




-)(^+ 


|C'iC2 


Pio = 


|$_ 




C1C2 


Pll = 


1^- 




|ClC2 



If the Bell measurement yields outcome 01 or 11, both occurring with equal probability 
A1A2, then AUce and Bob share the state -i^dOl)"^^ ± |10)^^). For the outcome 01, 
they recover the singlet state ^(|01)^^ - |10)^^) from the state ^(|01)^^ + |10)^^) 
if Bob applies the operator Z on system, where 



|0)(0|^-|1)(1 



is a Pauli operator. For measurement outcomes 00 and 10, obtained with equal proba- 



bilities ^li^ 



the reduced states on Alice's and Bob's systems are 



(Ai|00)^^± 



A2|ll)'^'^). These states are not maximally entangled. ( See Chapter 2 for a precise 
definition.) To obtain a singlet state with optimal probability ^|^^2 , Bob performs 
the following generalized measurement and communicates the outcome to Alice: 
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(1.1) 



If outcome is obtained, Alice and Bob recover a singlet state by applying appropriate 
Pauli operators on Bob's share. Otherwise, a failure is declared. Thus, the singlet 
conversion probability for this entanglement swapping strategy is equal to 2(AiA2 + 
A|) = 2A2. Remarkably, as was noted in [20], this corresponds to the optimal singlet 
conversion probability (SCP) for the state = ^/Ai|00) + V^lH) (to see this, just 
replace Ai and A2 in eq. (11. ip by -^/Ai and V^). This shows that the entanglement 
swapping strategy maximizes singlet conversion probability between Alice and Bob, 
which, in a one-dimensional chain with identical pure states between repeater 
stations, can never exceed the SCP of 

Unfortunately, the previous strategy cannot be extended to one dimensional 
chains with many repeater stations separating Alice and Bob's laboratories. In fact, 
as was shown in [20], no measurement strategy can keep the SCP between Alice and 
Bob from decreasing exponentially with the number of repeaters, making them useless 
for establishing entanglement over long distances. 

One way to deal with this problem is to introduce redundancy in the network 
[T^ . By preparing and distributing many copies of the state {ip) = ^/Ai|00) + H) 
across the chain, the repeater stations will be able to help Alice and Bob in producing 
singlets. The redundancy introduced in the network allows the stations to perform 
joint measurements on their shares, concentrating the entanglement found in each 
copy of into a small number of highly entangled particles. For one-dimensional 
chains, the rate at which entanglement can be established between the two endpoints 
will approach the entropy of entanglement S{A)^, no matter the number of repeaters 
introduced between the endpoints. The more copies of the state are prepared 
and distributed between the nodes, the more transparent the repeaters will become, 
allowing us to view the entire chain as a noiseless channel for Alice and Bob. 

This is an ideal situation, one unlikely to occur in real experiments, as only a 
finite number of copies of the state will be prepared and the preparation and 
distribution of copies of this state across the network will be imperfect. It is also 
reasonable to assume that the storage of many qubits at a repeater station, or at one 
of the laboratories, will be more prone to errors over time than the storage of a single 
qubit. Hence, the global state of a quantum network will most likely be mixed. For 
such mixed state networks, we can ask the question: how much entanglement can 
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we establish between Alice and Bob by performing LOCC operations on the systems 
part of the network ? 

In the following chapters, following a brief review of the relevant concepts in 
information theory, we consider several variations of the previous question and look 
at closely related problems. Although we do not solve the assisted distillation problem 
completely, we give new results for a less restricted form of the problem, compared to 
what was considered before in the works of DiVincenzo et al. and others [211 ISSl [231 
[211 [25], and rediscover known formulas for assisted distillation, established by Smolin 
et al. and Horodecki et al. in [2S1[21], by devising new protocols. In the remainder 
of this chapter, we give a brief summary of each of the following chapters, and then 
state the contributions found in this thesis. 

1.2 Summary 

The thesis consists of six chapters and one appendix. 

Chapter 2: Preliminaries 

This chapter is divided into three parts. First, we review relevant concepts in 
linear algebra. From this, we formulate the basic postulates of quantum mechanics 
in the language of linear algebra and discuss the density operator formalism. For 
our applications of quantum mechanics, this mathematical approach is more useful 
than standard formulations in terms of wave functions (Schrodinger picture) or time- 
dependent operators (Heisenberg picture). We then introduce the basics of quantum 
information theory, its formalism, and important results we will use in the following 
chapters. Finally, we conclude this chapter by reviewing three entanglement distilla- 
tion protocols. The first two protocols discussed are examples of "exact" approaches 
to entanglement distillation: assuming the protocols can be implemented without 
introducing errors, they yield a number of perfect Einstein-Podolsky-Rosen (EPR) 
entangled pairs with high probability. The Schmidt method describes a procedure, 
via projective measurements, for extracting EPR pairs. The hashing method, on the 
other hand, hashes an unknown sequence of Bell pairs until an exact subsequence 
is found (with high probability). The last protocol involves a different paradigm, 
prevalent in information theory: the use of random coding for showing the existence 
of a family of protocols producing states arbitrarily close to a product of EPR pairs 
at near optimal rates. We discuss this protocol in an informal manner, as this ap- 
proach will be studied in more detail subsequently and is central to the various tasks 
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analyzed in this thesis. 

Chapter 3: Multiparty state transfer 

This chapter has three parts. We begin by introducing the information-processing 
task of transferring a system from one location to another. Previous work by Abey- 
esinghe et al. [26] considered the problem from a "fully quantum" perspective: a single 
sender must use a minimal amount of quantum communication to transfer his entire 
system to the receiver. Existence of protocols achieving optimal rates was proven in 
this setting. Work by Horodecki et al. [2ll |25], who gave the first formulation of 
this problem, analyzed the task by substituting quantum communication with pre- 
shared entanglement and classical communication. Optimal rates were shown to be 
achievable by using a random measurement strategy. The problem was also extended 
to the multiple senders, single receiver setting, also known as distributed compression. 
In this chapter, we analyze new protocols for the task of multiparty state merging (m 
senders, one receiver) and split-transfer (m senders, two receivers) both in the one- 
shot regime and in the asymptotic setting. We also apply our split-transfer protocol 
to recover the formula in |2l], provided a certain conjecture holds, for the optimal 
assisted distillable rate when m helpers and two recipients (Alice and Bob) share a 
pure state. 

Chapter 4: Entanglement cost of multiparty state transfer 

This chapter has two parts. First, we reformulate the one-shot results of Chapter 
3 in terms of (smooth) min-entropies and provide protocols for one-shot multiparty 
state merging and one-shot split transfer. Our work extends some of the previous 
results by Berta [27] and Dupuis et al. [28], which considered the task of one-shot 
state merging for the case of a single sender. In the last portion of this chapter, 
we compare our multiparty state merging protocols for different family of states, 
highlighting interesting differences between the two protocols. 

Chapter 5: Assisted entanglement distillation 

This chapter has four main sections. After a brief introduction, we extend the en- 
tanglement of assistance problem to the case of mixed states shared between a helper 
(Charlie) and two recipients (Alice and Bob). This problem was first studied in the 
one-shot regime by DiVincenzo et al. [21], and a formula was found in the asymptotic 
regime by Smolin et al. [23] , which was generalized to an arbitrary number of parties 
by Horodecki et al. in [21]. We show an equivalence between the operational notion 
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of assisted entanglement and a one-shot quantity maximizing the average distillable 
entanglement over all POVMs performed by the helper. We proceed with an asymp- 
totic analysis of the mixed-state assisted distillation problem, deriving a bound on 
the achievable assisted distillable rate, which surpasses the hashing inequality in cer- 
tain cases. We generalize this analysis to the multiparty scenario and compare our 
approach with a hierarchical distillation strategy. 

Chapter 6: Conclusion 

This chapter summarizes the results established in the previous chapters. We 
also discuss some of the open problems remaining to be solved and propose different 
lines of research related to the subjects touched upon in this thesis. 

Appendix A: Various technical results 

The appendix contains proofs of various lemmas and propositions used in the 
previous chapters. We give more details regarding this part of the thesis in the 
contribution section. 

1.3 Contributions 

Chapter 2: Preliminaries 

This chapter does not contain any original material. An effort was made, how- 
ever, to present the introductory material with enough precision and substance that 
a reader with limited background in quantum information theory may grasp the es- 
sential ideas found in the following chapters. 

Chapter 3: Multiparty state transfer 
I. Removing time-sharing 

The distributed compression protocol of [21], although very intuitive and easy 
to understand by building upon the optimal rates achievable by the state merging 
primitive, must use a time-sharing argument to demonstrate achievability for rates 
which are not corner points. Our first contribution of this chapter is to give a proto- 
col for achieving multiparty state merging without requiring a time-sharing strategy. 
More specifically, we show that distributed compression for the case of two senders 
is achievable without the use of time-sharing. To show this, we adapt the ideas in 
[24] and perform a direct technical analysis of the task of multiparty state merging. 
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obtaining a bound on the decoupling error when each sender performs a random 
measurement. For the more general case of m senders, there is a technical obstacle 
to proving that time-sharing is not required for multiparty state merging. The dif- 
ficulty is more of a general quantum Shannon theory question than a problem with 
the analysis of multiparty state merging. We make a conjecture, which we call the 
multiparty typicality conjecture, and prove it is true for the case of a mixed state 
^CiC2 jj^ appendix A. 

II. Freedom in the distribution of catalytic entanglement 

A nice feature of our approach is to allow more freedom regarding the disposi- 
tion of the catalytic entanglement, sometimes needed when performing the task of 
multiparty state merging. We give a simple example to illustrate the benefits of our 
protocol over the distributed compression protocol of [21], which restricts catalytic 
entanglement to be distributed in a very specific way. This is the second contribution 
of our chapter. If time-sharing is not required for performing distributed compression 
for the case of three senders, we show that for certain states our protocol needs no 
catalytic entanglement, in contrast to the distributed compression protocol of [21], 
which needs catalytic entanglement even if some of the entanglement rates for such 
states are negative. 

III. Split-transfer 

The last part of this chapter considers the problem of state transfer for multiple 
senders and two receivers. Here, the senders are split into a group T and its comple- 
ment T. The objective is to redistribute the global state to the two receivers. More 
precisely, we must transfer the system T to one receiver while sending T to the other 
receiver. To my knowledge, this problem has not been studied before. Two indepen- 
dent applications of the multiparty merging protocol will achieve a split-transfer with 
optimal rates. In the spirit of the previous sections of this chapter, we consider this 
problem directly by customizing our multiparty merging protocol for this task. 

IV. Answering the min-cut conjecture 

Our last contribution in this chapter is an answer to a conjecture posed by 
Horodecki et al. in [2l] in the context of assisted distillation. The optimal multi- 
partite entanglement of assistance rate was found to be equal to the minimum-cut 
bipartite entanglement min7-5'(AT), where the minimization is over all possible cuts 
T of the helpers. The proof in [2l| is recursive: they show that, with high probability. 
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the min-cut entanglement is preserved after one helper has finished his random mea- 
surement and apply this reasoning recursively for all other helpers. The conjecture 
asks if this recursive argument can be removed. More precisely, if a strategy where 
all the helpers performed their random measurements all at once will yield a state 
which preserves, with high probability, the minimum cut entanglement of the state. 
We show that this is true for almost all cases provided the multiparty typicality con- 
jecture holds. Under this assumption, we show how to redistribute (many copies of) 
the original state using our split-transfer protocol in such a way that it preserves the 
min-cut entanglement. The receivers (Alice and Bob) can follow with a distillation 
protocol, yielding a rate of EPR pairs corresponding to the min-cut entanglement of 
the original state. 

Chapter 4: Entanglement cost of multiparty state transfer 

I. Entanglement cost region of multiparty merging 

Our first contribution of this chapter is to reformulate the upper bound derived 
in Chapter 3 for the decoupling error as a function of various min-entropy quantities. 
With this result in hand, we give a partial characterization in terms of min-entropies 
of the entanglement cost region achievable for multiparty state merging when a single 
copy of the state is available. For any point of this region, we show the existence 
of multiparty merging protocols of the kind described in the previous chapter, where 
all the senders measure their systems simultaneously and the decoder implemented 
by the receiver is not restricted to recovering the systems one at a time. We derive 
analogous results for the task of split-transfer by applying the same proof technique. 

II. Smooth min-entropy characterization 

Using the approach of Horodecki et al. [2l] for achieving a distributed compres- 
sion of a multipartite state ^^c'lCa-.-Cm/?^ analyze the entanglement cost associated 
with multiparty merging when a single-shot state merging protocol is applied itera- 
tively, according to some ordering vr : {1,2,..., m} — j- {1,2,..., m} on the senders. 
By building upon the results of Berta [27j and Dupuis et al. [28] , we show the exis- 
tence of multiparty merging protocols with arbitrarily small error and entanglement 
cost characterized by the smooth min-entropies of the reduced states tp ' t-^co, where 
-R^-i(j) is the relative reference for the sender Cj with respect to an ordering vr of the 
senders. This is the second contribution of this chapter. 
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III. Examples of one-shot distributed compression 

The remainder of this chapter is devoted to examples. We compare the protocols 
described in this chapter for the task of distribution compression. We give three 
examples, two of them being closely related to the second distributed compression 
example of Chapter 3, and look at the entanglement costs required for merging the 
states. We find, once again, that our direct approach to the task of multiparty merging 
yields better results: our protocol outperforms an application of many single-shot 
two-party state merging protocols by allowing some of the senders to transfer their 
systems for free. 

Chapter 5: Assisted entanglement distillation 

I. Generalizing the entanglement of assistance 

The first contribution of this chapter is to extend the one-shot entanglement of 
assistance quantity, first defined in [21], to handle mixed states ip^^^ shared between 
two recipients (Alice and Bob) and a helper Charlie. This quantity reduces to the 
original entanglement of assistance when the state is pure. We give an operational 
definition of assisted distillation for mixed states ip^^^ and show an equivalence 
between the optimal distillable rate and the regularization of the entanglement of 
assistance quantity. This equivalence is used in the following section for proving 
achievable rates on the optimal assisted distillable rate when the parties share many 
copies of a mixed state ip^^^ . We give two upper bounds to the entanglement of 
assistance for mixed states, and provide an example which saturates one of the upper 
bounds. 

II. Achievable rates for assisted distillation 

Using the equivalence between the optimal distillable rate and the regulariza- 
tion of the entanglement of assistance quantity, we give a lower bound on the op- 
timal rate for assisted distillation of mixed states for the case of one helper. We 
prove the existence of a measurement for the helper Charlie which will preserve, 
with arbitrarily high probability, the minimum cut coherent information L^ip) := 
{I{AC)B)^,I{A)BC)^} of the input state. This is the second contribution of this 
chapter. Using this measurement in a double blocking strategy, Alice and Bob can 
recover singlets at the rate L{'il)) by applying standard distillation protocols as in 
[29] . If Charlie preprocesses his share of the state to optimize the minimum cut co- 
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herent information, higher rates can potentially be achieved. If the state if)^^'^ does 
not saturate strong subadditivity, and the coherent information I{C)AB)^ is posi- 
tive, the achievable rate is higher than what the hashing inequality guarantees when 
performing a one-way distillation protocol. 

III. Optimality 

Achievability of the min-cut coherent information has a surprising consequence: 
we can achieve a rate close to what could be obtained if Charlie were allowed to send 
his system to either Alice or Bob, whichever minimizes the minimum cut coherent 
information. We give a specific example where Charlie is not capable of transferring 
his system to Alice for free, but the assisted rates achievable are nonetheless close to 
I{AC)B)^. When L{'il)) is the coherent information I{A)BC)^, however, Charlie can 
merge his system to Bob. For such a case, we can achieve an optimal rate for assisted 
distillation by applying a merging protocol before engaging in a distillation protocol. 
This is the third contribution of this chapter. 

IV. Fault-tolerance 

We compare our assisted distillation protocol to a hierarchical strategy consisting 
of entanglement distillation followed by entanglement swapping. The first example 
we analyze considers a one-dimensional chain where the Alice to Charlie's channel is 
noiseless but the Charlie to Bob channel is noisy. For a state in a product form, we 
find that the rate achieved by our protocol is the same as the rate obtained by using 
a hierarchical strategy. We modify our setup by introducing a CNOT error affecting 
Charlie's systems. We show that our random measurement strategy is fault-tolerant 
against such error: the assisted distillation rate remains the same, even in the absence 
of error correction by Charlie. On the other hand, the rate obtained by a hierarchical 
strategy becomes null. Thus, we identify a major weakness to using hierarchical 
strategies: it is not fault-tolerant against errors arising at Charlie's laboratory. 

V. Multipartite entanglement of assistance 

The last part of this chapter generalizes the multipartite entanglement of assis- 
tance of [23l [23] to allow an arbitrary multipartite mixed state shared between m 
helpers and two receivers. Our one-shot quantity reduces to the original multipartite 
entanglement of assistance quantity when the state is pure. We derive an upper bound 
to this quantity, and then perform an asymptotic analysis, proving the existence of 
protocols achieving a rate which is at least the minimum cut coherent information 
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I{AT)BT).ip, where T is a cut of the helpers. Our proof relies on a multiple blocking 
strategy and suggests the possibihty of a simpler protocol for achieving the minimum 
cut coherent information. This is the fifth contribution of this chapter. 

Appendix A: Various technical results 

I. A different proof of the twirling average 

We give a detailed calculation of the twirling average, a key result (see [21] and 
[26] for the original proof) used in Chapter 3 for proving one of the important results 
of this thesis. Our proof does not rely on Schur's lemma, a fundamental result in rep- 
resentation theory, but instead relies on the invariance property of the Haar measure 
with respect to permutations, sign-flip operators and Hadamard transformations. 

II. Convexity of the entanglement of assistance 

We give a proof of the convexity of the entanglement of assistance for pure en- 
sembles {pi, ipf^^}. This result is used in Chapter 5 for proving an upper bound to 
the one-shot entanglement of assistance. 

III. Lower bound to the smooth max entropy 

By removing the smallest eigenvalues of a state p, without disturbing the state 
too much, we get a useful lower bound to the smooth max entropy H^^J^p). We use 
this bound in Chapter 4 for the various examples we analyze. 

IV. Multiparty typicality conjecture 

We give a proof that the multiparty typicality conjecture is true for the case 
of a mixed state Our proof relies on a well-known inequality of probability 

theory and uses a double blocking strategy for constructing a state which satisfies the 
typicality conjecture (see Section [3.2.4p . 
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Preliminaries 



2.1 Representation of physical systems 

2.1.1 Hilbert spaces and linear operators 

A set y is a vector space over a field T if given two operations, vector addition 
and scalar multiplication, it satisfies certain axioms (see table I2.ip . Examples of 
commonly used fields are the field of real numbers M, the field of complex numbers 
C, and the Galois field F2 consisting of two elements, and 1, for which addition 
and multiplication correspond to XOR and AND operations. Examples of vector 
spaces are the Euclidean n-space M", the complex vector space C", and the space of 
all functions / : X — J-" for any fixed set X. For the space C", the vectors are the 
n-tuples z = {zi, Z2, ■ ■ ■ , Zn) with Zi G C, and the addition and scalar multiplication 
operations are defined in a pointwise fashion: for vectors x = (xi, X2, . . . , x„) and 
y = {yi, y2, ■ ■ ■ , y-n) in C", and scalars a G C, we have 

X + y = {xi+ yi,X2 + y2,- ■ ■ ,Xn + Vn) and ax = {axi,ax2, ■ ■ ■ ,axn). 

A set 5 = {t"!, ^2, • • • , t'n} of vectors in V is called a basis of the vector space V if it 
is a linearly independent set which generates the whole space V . That is, no vector 
in B can be written as a linear combination of finitely many other vectors in B, and 
the set of all linear combinations of the vectors in B correspond to the whole space 
V . A vector space with basis {^1,^2, • • • , v^} is said to have dimension dy = n. 

To add notions of length and distance to a vector space, we introduce a third 
operation, called the inner product {u,v) : VxV J^. Here, the field is usually taken 
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closure 


If u and V are in V, then u + v is in V. If a G C 




and u E V, then au G V. 


associativity 


u + {v + w) = {u + v)+wfoT all u, V, and w in V. 


compatibility 


a{hv) = {ah)v for all G and all a, 6 G C. 


commutativity 


u + V = V + u ioi all n, G V . 


zero element 


An element in ^ exists such that t'+O = v = 




for all V eV . 


inverse 


For each f G an element — f exists in V such 




that — w + f = = w + (— w). 


distributivity 


a{y + w) = av + aw and (a + b)v = av + bv for all 




v,w E V and a, 6 G C. 


identity 


1?; = f for all v E V. 



Table 2.1: Axioms for a complex vector space. 



conjugate symmetry {u, v) = {v, u) for all u,v eV . 

linearity {v + w,u) = {v,u) + {w,u) and {rv,w) = r{v,w) 

for all u, V, w in V and r G C. 
positive-definiteness (w?^) > for all v E V with equality iff f = 0. 

Table 2.2: Axioms for the inner product when = C 

to be either R or C. An inner product must satisfy the three properties described 
in table 12.21 A vector space V with an inner product (, ) is called an inner product 
space. We can define an inner product for the space C" as follows: 

{x, y) := Xiyi + ^22/2 + • • • + XnVn- 

We have (x, x) = + |x2p + . . . + > 0, and the other two axioms can be 
verified just as easily. For an inner product space V, we assign a "length" to a vector 
V via the norm 

\\v\\ := {v,v). 

A vector space V on which a norm is defined is called a normed vector space. For 
two vectors x,y oi a normed space V, we can add a notion of distance between two 
vectors x and y by using the norm: 

d{x,y) := \\x - 

Symmetry and positivity of d[x^ y) follow easily from the above definitions. The 
triangle inequality d{x, z) < d{x, y) + diy, z) can be recovered using the Cauchy- 
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Schwarz inequality: 

\{x,y)\ < 

A space V for which a distance function d{x, y) is defined is called a metric space. 
A metric space is complete if and only if every sequence Xi, X2, X3, . . . of vectors in 
V for which d{xn,Xm) 0, as both n and m independently tends toward infinity, 
converges in V. That is, for every such sequence xi,X2,X3, . . . there exists a y E V 
such that d{xn, ?/) — as n — )■ 00. 

A Hilbert space H is a real or complex inner product space which is also a com- 
plete metric space with respect to the distance function induced by the inner product. 
For finite dimensional Hilbert spaces, the completeness criterion is automatically met 
and, thus, any real or complex inner product space is also a Hilbert space. As we will 
see shortly, Hilbert spaces arise in quantum mechanics to model the state space of a 
physical system. The tasks analyzed in this thesis involve quantum systems which can 
be adequately described using finite dimensional complex Hilbert spaces. Henceforth, 
we assume the Hilbert spaces to be of finite dimension. Vectors for a complex Hilbert 
space Ha associated with a physical system A are written using the Dirac notation, 
also known as bra-ket notation, in the form • • • These vectors are called 

kets, and for every ket 1?/;)^ of the Hilbert space "Ha, henceforth written simply as A, 
there is an associated linear functional : A ^ C called a bra: 

(^|(|0)):=(^,0), 

where the right hand side is the inner product of the two vectors \ip)^ and The 
motivation for the bra-ket notation comes from this last definition, where we see that 
by removing parentheses around the vector \(f))^ and fusing the bars together on the 
left hand side of the definition, we obtain a complex number {ip\(f)) called a bra-ket 
or bracket. 

A basis for the space is given by {(0, 1), (1, 0)}, which can be rewritten in 
braket notation as {|0),|1)}. This is known as the computational basis for the 
space C^. For the general space C", the computational basis will be written as 
|2), |3), . . . , In)}. Any vector {tp) G C" can then be written as 

n 

1=1 
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Given two Hilbert spaces A and B, we can construct a larger Hilbert space of 
dimension dAds by taking the tensor product A® B. Given two orthonormal bases 
{\vi)^}i^i and {\wj)^}j^i of A and B (i.e {vi\vj) = and {wi\wj) = for any i ^ j ), 
the tensor product A®B is the space generated by the basis elements {\vi)^®\wj)^}. 
How tensor products \vi)'^ ® ^'^^ formed for two vectors \vi)'^ and \wj)^ is a bit 

more technical, and we refer to [30] for more information on this subject. The tensor 
product for complex vector spaces satisfies the following three properties: 

1. For any z G C and arbitrary vectors \v)'^ of A and \w)^ of i?, 

^(1^;)^ eg) \w)'') = {z\v)^) ® \w)'' = \v)'^ ® {z\w)''). 

2. For arbitrary vectors \vi)^ and 1^2)'^ in A and \w)^ in 5, 

3. For arbitrary vectors \v)^ in A and \wi)^ and |tf2)'^ in -B, 

As an example, for the two Hilbert spaces A := C" and B := C™", the tensor product 
of the two vectors lip)"^ = X]r=i'^«K)^ \^)^ ~ is given by 

n m 

i=i j=i 

where we have written for the tensor product ® This shorthand 

notation will often be used in the following chapters. 

Vectors of a Hilbert space A can be transformed via linear operators L : A ^ B. 
The image of L is defined as 

im L := {L\v) : \v) G A}. 

It is a subspace of B and its dimension is called the rank of L. The set of all linear 
operators L : A ^ B is denoted by C{A,B). For linear operators acting from A 
to itself, we use the shorthand notation C{A). Given any basis {vi,V2, ■ ■ ■ ,Vn} of a 
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Hilbert space A, the trace of an operator L G C{A) is defined as 



n 



Tr{L) ■.= J2{v^\A\v^). 



i=l 



Several classes of linear operators will be of interest to us. The first one is the 
set of hermitian operators acting on the Hilbert space A. Given a linear operator 
L : A ^ A, the hermitian conjugate (adjoint) of L is the unique operator such 
that for all vectors |f )'^, Iw)"^ G A, 



An operator H whose hermitian conjugate is H is known as hermitian or self-adjoint. 
In general, for two operators A and B, we have {ABy = B'^A''. By convention, we 
also define := {v\. General hermitian operators can be written elegantly via the 
spectral decomposition theorem. 

Theorem 2.1.1 (Spectral decomposition). Let H he an hermitian operator acting 
on a Hilbert space A. Then, there exists an orthonormal basis {|ei)^}f^;^ of A such 
that H is diagonal with respect to this basis: 



where all the eigenvalues of H are real numbers. 

The image of H is spanned by all the eigenvectors |ej)^ with non-zero eigenvalues. 
It is also called the support of H. 

An important subclass of hermitian operators are projection operators P : A ^ 
W. These are hermitian operators which are also idempotent: 



Given a (ii4/-dimensional subspace W oi a Hilbert space A and an orthonormal basis 
of W, the projector onto the subspace W is defined as 




H = ^ Ai|ei)(ei 



i=l 



= P and = P. 



P = Y.\€){i 



1=1 
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The orthogonal complement of P is given by Q := — P, where is the identity 
operator on A. For a vector of A, the operator often written simply 

as is the projector onto the 1-dimensional subspace spanned by the vector {tp)^. 
Projectors will be used later on to describe the process of measuring a physical system. 

Another subclass of hermitian operators that we will frequently use are the pos- 
itive semidefinite operators. An operator X on A is positive-semidefinite if for all 
vectors \v)^ G A, the inner product (t;|X|f ) is a real and non-negative number. We 
will often drop the word "semidefinite" and refer to X simply as a positive operator. 
From the spectral decomposition theorem, the eigenvalues and the trace of a positive 
operator X must be non-negative real numbers. Given hermitian operators H and 
K acting on the space A, we say that H < K ii K — H is positive. This defines a 
partial ordering on the set of hermitian operators. The class of positive semidefinite 
operators of trace one have a special importance in quantum mechanics, and we refer 
to them as density operators. We will see shortly that they capture the statistical 
behavior of a quantum system. 

One very important class of linear operators we will be concerned with are the 
unitary operators. An operator Ua in C{A) is said to be unitary if U\Ua = Ia- 
This also implies UaU\ = I^. Unitary operators preserve lengths and angles between 
vectors. For any pair of vectors \vi)^ and \v2)^ of A, we have 

||f/A|^i)^||i = |||t;i)^||i. 

{Ua\v^)^,Ua\v2)'') = {\V^)^AV2)^). 

The last line can be easily seen to hold by rewriting the inner product {Ua\vi)^, Ua\v2)^) 
in braket notation as (vi\U\U a\v2) ■ The result then follows by substituting U\Ua 
with the identity operator. Unitary operators can be used to construct new orthonor- 
mal bases: given an orthonormal basis {Iwj)'^}^^;^ of A, let \wi)^ = UaIvi)^. Then 
{\'^i)'^}i=i is an orthonormal basis of A. 

We can generalize the class of unitary operators by considering input and output 
spaces of different dimensions. An isometry F : A B ior two Hilbert spaces A and 
i? is a linear operator which satisfies 

dBiF\a),F\b)) = dAi\a),\b)), 

for any two vectors \a),\b) in A. A unitary operator Ua on a Hilbert space A is a 
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special case of an isometry where the function F is also surjective (i.e the image of 
F is A). For two Hilbert spaces A and B of different sizes, with < d-B, we can 
extend any unitary operator Ua to an isometry V : A ^ B hj identifying a subspace 
B' of dimension ds' = d^ with A and letting V^lf)"^ = 7i{U\v)^), where n : A ^ B' is 
an isomorphic map from A to B. Define the kernel (ker(F)) of F to be the subspace 
of all vectors in A which map to the zero element of B under the function F. A 
function W : A ^ B is a partial isometry if, for any two vectors |a), \b) in the 
orthogonal complement of ker(iy), we have dB(W\a),W\b)) = (i^(|a), Partial 
isometries appear in chapter 3 to model a random coding strategy in the context of 
state merging. 

Finally, given two linear operators R and S acting on the spaces A and B respec- 
tively, the Kronecker product R^ S is the matrix 



where R and S are the matrix representations of the operators R and S. 

2.1.2 Quantum mechanics 

Unlike the theory of relativity, which was the work of a single individual [31] . 
the theory of quantum mechanics as we know it today was the culmination of years 
of work from various physicists during the first half of the twentieth century. The 
failure of classical physics to explain observed phenomena such as the ultraviolet 
catastrophe and the photoelectric effect forced physicists to reconsider the nature of 
the physical world. A new set of rules was required for making accurate predictions 
on the outcome of any scientific experiment. After a relatively long process of trial 
and error, a mathematical formulation of quantum mechanics was made precise and 
found to successfully predict all scientific experiments known at the time. Since then, 
no known experiment has contradicted the predictions of quantum mechanics. Any 
physical theory based on the structure of quantum mechanics must obey the following 
four basic postulates: 

Postulate 1 Associated with any physical system A is a Hilbert space A called the 
state space. The system is completely described by its density operator ip"^, 
which acts on the state space of the system A. 





R^S : 



(2.1) 




RdAdA^ J 
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Postulate 2 The evolution of a closed quantum system A is described by a unitary 
transformation U. That is, if and are the density operators of the system 
A at times ti and t2, they are related by a unitary operator Ua which depends 
only on ti and t2'- 

Postulate 3 Quantum measurements realized on a physical system A are described 
by a set of linear operators {M^} acting on the state space of A. The probability 
of obtaining outcome m is given by 

p{m) = TiiMlM^ij^), 

where is the density operator describing the system A. After obtaining 
outcome m, the system is described by the density operator 

Tr(M^V^Mi) ■ 
The operators {M^} satisfy the completeness equation, 

J2mIM^ = I^. (2.2) 

m 

Postulate 4 The state space of a composite physical system is the tensor product 
y4i €5 . . . ® v4„ of the state spaces Ai, A2, . . . , An of the component physical 
systems. Moreover, if each system Ai is described by the density operator 
the density operator of the system A1A2 . . . An is given by ■ip^'^ ®ip^^ <S) . . .<S)ip^"- 

Other equivalent formulations of quantum mechanics exist (see, for instance, [32| [33| 
131]). In the context of quantum information theory, however, the previous formulation 
in terms of density operators will be very useful as we will often deal with composite 
systems in an unknown state. The density operator gives a complete mathematical 
description of the statistical behavior of its associated system. From the spectral 
decomposition, any density operator can be written as a convex combination of 
normalized eigenstates: 

d 

i=l 
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where Ai > for 1 < i < d. li d = 1, the system is in the pure state 
often written simply as We will often use the term "state" to refer to the density 
operator \ip) {ipl^ as opposed to the vector \ip)^ of the state space. If d > 1, the system 
is said to be in the mixed state ip^. In such a case, different ensembles {pi, I'lpi)^} 
of pure states may realize the density operator ip"^ = example, 
consider the ensembles {1/2, |0)^, 1/2, |1)^} and {1/2, if |0)^ 1/2, if|l)^}, where H 
is the Hadamard operation: 

H\0)^ :=i=(|0)^ + |l)^), 
H\l)^ :=i=(|0)^-|l)^). 

Both ensembles realize the same density operator 

7/;^ = 1/2|0)(0|^ + 1/2|1)(1|^ = /^/2 = l/2(i/|0)(0|^ift + H\1){1\^H^), 

which is called a maximally mixed state r"^ := 1^/2 of dimension d^- 

When realizing a quantum measurement on a system A, we may only be interested 
in the outcome of this measurement (for instance, to distinguish between two possible 
states ipi and ip2 of ^^e system). For a set of measurement operators {Mm}, let 

Em = MlMm Vm. 

Then, {Em} are positive operators as for any \v)^, we have {v\Em\v) > 0. According 
to Postulate 3, the probability of obtaining outcome m is given by Ti^Ml^Mmi^^)- 
Replacing M^M^ by Em, the probability p(m) is equal to Ti^Em'ip'^), and from the 
completeness equation eq. (12.21) . we have 

Em = I^- 

m 

The set {Em} is the POVM (Positive Operator Valued Measurement) associated 
with the measurement. Conversely, let {Em} be a set of positive operators acting 
on A which satisfy J2m^m = I"^- Writing Em = XliLi (^jl^^ using the spectral 
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decomposition, we can define measurement operators Mm '■= y/Em, where 

d 

\fKa ■■= ^A//ii|ei)(ei|^. 

i=l 

Then, the set \Era\ is the POVM associated with the measurement described by 
the operators {Mm}. It is important to understand that measurements described via 
POVMs will generally not allow us to known the state of the system afterwards. This 
is due to the fact that, for a given POVM {-Em}, we can choose any set of unitaries 
{t/m} and construct measurement operators = UmVEm which will describe a 
measurement with POVM {Em}- 

Suppose we have a composite system AB, whose state is described by its density 
operator ip^^ . We can prescribe a "state" to the subsystem A via the reduced density 
operator: 

:= TiBip"^^, 

where Ttb '■ A^ B ^ A is a. linear operator known as the partial trace, and is defined 
by 

TrB(ki)(x2r® := \x^){x2\^TT{\y^){y,\^), 

where \xi)^ and \x2)^ are any two vectors in the state space of A and \yi)^ and 
12/2)^ are any two vectors in the state space of B. This gives a correct description 
of the statistical behavior of the subsystem A in the sense that for any measurement 
Mm on A, the outcome probabilities p{m) computed using equal the probabilities 
q{m) computed using the density operator ip"^^ for the measurement Mm ® Ib- The 
partial trace is the unique function satisfying this property. For the tensor product 
state = 0"^ ® 0^, the reduced state is equal to TiB'ip^^ = o^. 

2.1.3 Schmidt decomposition and purifications 

For any pure state "^^^ of a composite system AB, we can always find orthonor- 
mal states Icj)"^ for the system A and orthonormal states for the system B such 
that '^^^ can be written as a superposition of the states \eifi)^^. This is the Schmidt 
decomposition theorem: 

Theorem 2.1.2 (Schmidt decomposition). Suppose \ip)^^ is a pure state of a com- 
posite system AB. Then, there exist orthonormal states {\ei)^}f^i for the system A 
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and orthonormal states {\fi)^}f=i for the system B such that 

i=l 

where the Aj are non-negative real numbers satisfying Af = 1 known as Schmidt 
coefficients. The number of non-zero values \i is called the Schmidt rank for the state 

As a consequence of the Schmidt decomposition, the reduced density operators 
ip"^ and Tp^ of the state tp"^^ share the same spectrum: 

i=l 
i=l 

For a proof of the Schmidt decomposition theorem, see Nielsen and Chuang [35]. 

For a mixed state ijj^^ it is always possible to introduce another system i?, called 
a purification system, and find a pure state \ip)^^ of the composite system AR such 
that ijj^ = Tifiip^^. To see this, write as ^1^=1 using the spectral 

decomposition theorem. Let R have dimension dji := d^, with orthonormal basis 
states \ei)^, and define the pure state 

dA 
i=l 

Then, we have 

Trnmii^D = 5^ v^|e.)(e,|^Tr(|e,)(e,|«) 
= ^ A/A~X~|ei)(ej|^(5ij 

ij 
dA 

1=1 
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where Sij is the Kronecker symbol. It is always possible to purify a state in more 
than one way. However, for any two purifications and 10)"^^^ of a state ip"^, 

with purification systems -Ri and R2, there exists a partial isometry U : Ri R2 
taking the Ri system to the R2 system such that 

^ ^jA^^Ri)|^^A/?i_ ^2.3) 

2.1.4 Separable states and maximally entangled states 

A state ip^^ of a composite system AB is called separable if it can be written as 
a convex combination of tensor products of density operators {pf} of the subsystem 
A and density operators {erf} of the subsystem B: 

i 

where ^jPi = 1. Since the partial trace is a linear operator, we get from the previous 
equation that = YliiPiPf '^^ — YliiPi^i ■ ^ state ip^^ which is not separable 
is called entangled. For a pure state I^A)^^, the previous definition of separability 
implies that \ip)'^^ is separable if and only if there exist vectors \x)^ and \y)^ of A 
and B such that = \x)^\y)^. Alternatively, a pure state l^p)"^^ is a product 

state if and only if its Schmidt rank is 1. Therefore, any pure entangled state for a 
composite system AB whose subsystems A and B both have dimensions two must 
have a Schmidt decomposition with two non-zero values Ai and A2. Examples of pure 
entangled states for such a composite system are the Bell states: 
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where the last state is known as the singlet state or an EPR pair [36]. These states 
form a basis of the tensor product space A® B. In general, for two systems A and B 
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with (Ia ^ ds, a. maximally entangled state of dimension dA is defined as: 




where {\'m)^}'^^^ is an orthonormal basis for A and {\m)^}'f^^ is a family of orthonor- 
mal vectors on B. The amount of bipartite entanglement in a state ip^^ is measured 
in ebits, with Bell states having an amount of entanglement equal to one ebit. A 
maximally entangled state of dimension dA is said to have \og{dA) ebits. 

Other examples of entangled states are obtained via mixtures of Bell states. The 
family of Werner states [37] is defined as 

Wf = + + |<|._)($_| + |^+)(^+|), 

where < F < 1. The value of F for the Werner state Wp is equal to (\i/_|VFir|\E'_), 
which is the entanglement fidelity of Wp relative to the singlet state. For an arbitrary 
bipartite state ip"^^, the entanglement fidelity of relative to the singlet state is 
defined as: 

F^^P^^, |^-)(^-|) := (^-I^A^^I^-). 

The Werner state Wp is separable for F < 1/2, and entangled for F > 1/2. Werner 
states facilitate the analysis (see, for instance, Bennett et al. [38]) and construction 
of entanglement distillation protocols: the process of converting a large number of 
copies of an entangled state ip"^^ to a smaller number of highly entangled states such 
as EPR pairs. 

2.2 Quantum information 

2.2.1 von Neumann entropy 

The Shannon entropy H{X) [39] of a random variable X yielding outcome x with 
probability px is defined as: 

H{px) = H{X) ■= - ^px \ogpx, 

X 

where the logarithm is taken base 2. This quantity is always non-negative, with 
H{X) = if and only if the random variable X yields a definite outcome x with 
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Px = 1- It takes a maximal value of \ogd for a random variable X generating d 
possible outcomes with equal probabilities. 

For data communication, the Shannon entropy is the theoretical limit at which 
information produced by a source can be compressed, transmitted and recovered in 
a lossless way. Its basic unit is the bit. A binary random variable X taking values 
and 1 with probability one-half is said to have 1 bit of entropy. 

For a density operator t/;^, with spectral decomposition = Aj|ej)(ej|^, 
we define its von Neumann entropy S{A)^ as 

5(A)^ = -Tr(^^logz^^), 

where logip^ = log(Aj)|ej)(ej|'^ and the logarithm is taken base two (we define 
Olog(O) := 0). The spectral decomposition of ip^ allows us to relate the von Neumann 
entropy to the Shannon entropy: 

d 

5(A)^ = -5^A,logA, = i/(X), 

i=l 

where X is a random variable yielding outcome i with probability A,. The von 
Neumann entropy is non-negative, and is zero if and only if the state is pure. For the 
maximally mixed state I^/d, we have S{A)jA/d = logd. 

The original motivation for the von Neumann entropy did not come from an 
information-theoretical context, unlike the Shannon entropy. It was actually an at- 
tempt to extend a thermodynamical concept, the Gibbs entropy (see [IQ] for an in- 
troduction to thermodynamics), to the quantum setting. The extension of Shannon's 
work to the quantum regime happened several years later, beginning with the work 
of Ohya and Petz [H]. The quantum version of Shannon's noiseless coding theorem 
was obtained by Benjamin Schumacher |12], who coined the term qubit, the basic 
unit of quantum information, and characterized the von Neumann entropy as the 
optimal rate at which quantum information produced by a source can be compressed, 
transmitted and recovered by a receiver in a lossless way. 

A qubit is a 2-dimensional quantum system A. A composite system in any of 
the Bell states constitute two qubits. The reduced state of either subsystem is in the 
maximally mixed state, with entropy S{A)ja/2 = S{B)jb/2 = 1- If state ip^^ of a 
composite system AB is pure, we have S{A)^ = S{B)^. Given a state = YliiPi'^t 
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written as a convex combination of other states ipf, the von Neumann entropy is a 
concave function of its inputs ipf: 

i 

where equahty holds iff all the states ipf, for which pi > 0, are identical. The von 
Neumann entropy is invariant under unitary transformations on the state ip^: 

S{A)^ = S{A)u^ut. 

For a state ip^^ of the form tjj^^ = ^^iPili) ® ^f, we have 

S{XA)^ = H{X) + J2p,S{A)^a, 

i 

where H(X) = — ^jPilogpj. We refer to the state ip-^"^ as a classical-quantum 
(cq-) state with classical system X. 

For a tensor product state = ^ cr^ , we have S{AB)^ = S{A)p + S{B)^. 
The von Neumann entropy of a joint state ip^^ satisfies the following inequality, 
known as subadditivity (see [35] for a proof): 

SiAB)^ < SiA)^ + SiB)^, (2.4) 

where S{A)^ and S{B)^ are the von Neumann entropies for the corresponding reduced 
density operators and ip^ . Equality holds if and only if the state ip"^^ can be 
written in the product form (8> ip^- Given a state ip^^ of a composite system AB, 
the conditional von Neumann entropy S{A\B)^ is defined as 

S{A\B)^ = S{AB)^-S{B)^. 

Unlike the Shannon entropy H{X\Y) = H{XY) — H{Y), the conditional von Neu- 
mann entropy can be negative. As an example, consider the singlet state 
We have S{AB)^ = since the state is pure, and S{B)^ = 1 since its reduced density 
operator is the maximally mixed state /_b/2. For a tripartite system A ^ B ^ C in 
the state ip^^'-^ , the conditional von Neumann entropy S{A\BC) is bounded above 
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by S{A\By. 

SiA\BC)^ < SiA\B)^. 

This is known as the strong subadditivity property of the von Neumann entropy. First 
conjectured by Lanford and Robinson a proof of this inequahty was obtained 
by Lieb and Ruskai in [H] . A simple operational proof of this inequality also follows 
from quantum state merging [211 [25] . 

2.2.2 Quantum operations, instruments, LOCC 

A quantum channel is a medium for carrying quantum information from one loca- 
tion to another. To motivate its mathematical description, let's consider a collection 
of linearly polarized photons, each prepared in some polarization state ipf. As part 
of a protocol implemented by two spatially separated parties (for instance, the BB84 
cryptographic protocol of Bennett and Brassard [6]), we need to send the photons 
through a fiber optic channel to another laboratory B. Fiber optics, unfortunately, 
are not a perfect medium for information transmission. Photons sent through a fiber 
are subject to attenuation, also known as transmission loss, and dispersion effects. 
These phenomena will have highly undesirable consequences on the polarization state 
of the photons, and may prevent the detection of photons by the receiver's apparatus. 
We can model the process of photons passing through a fiber as two systems which in- 
teract for some period of time. If we assume the photons and the fiber form a closed 
system, their interaction can be described by a unitary operator U : AE BE, 
where E represents the fiber optic system (also called the environment). The exact 
specification of the unitary will depend on the characteristics of the fiber. We can 
assume, prior to transmission, that the AE system is in a product state ip^ ® |0)(0|^. 
A fiber optic channel is then represented by a map M : A ^ B such that 

^/;«=Ar(^^) :=Trs([/(V;^® |0)(0|^)?7t). (2.5) 

This is known as the Stinespring form for the channel M . We can re-express the 
previous formula by introducing linear operators Ei : A B , defined as: 

E, := (z|^f/|0)^. 
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where {|^)^} is a basis of the environment system E. Putting these into eq. (I2.5p . we 
have 

i 

Alternatively, one could start from a set of linear operators Ei : A ^ B which satisfy 

Y,ElE-.<l\ (2.6) 

i 

known as the completeness relation for the operators and define the quantum 

operation 

S{i^^):=Y,E^i^^El (2.7) 

i 

That the output of this operation is a sub-normalized (i.e Trp < 1) density operator 
follows from the completeness relation. Quantum channels can be regarded as a 
quantum operation whose intent is to carry quantum information. As an example, 
consider the following channel S : A ^ B for transmitting a qubit in the state 
|V^)^ = ao|0)^ + ai|l)^: 

£:(|^)(V;|^):=(l-p)|V')(^r+p|2)(2|^ (2.8) 

where < p < 1 and |2)(2|^ is a state orthogonal to |0)^ and This is a 

simple model for photon loss. The transmitted photon is either perfectly detected 
with probability 1 — p or replaced by some "erasure" state with probability p. 

Eq. (12.71) is known as the operator-sum representation of a quantum operation. 
It allows for more general forms of quantum operations than the previous formulation 
of quantum operations in terms of interacting systems. The operators Ei are called 
Kraus operators. A quantum operation which has a non-trace-preserving output 
corresponds to a process which occurs with probability Ti{£{il)^)) (for instance, a 
specific measurement outcome). 

For a composite system AR, a quantum operation acting on the density operator 
^AR gjiQ^y leave the composite system AR in a density operator (up to some nor- 
malization) after the operation is performed. This is called the completely positive 
requirement: 

:= (/^®£)(^^^) > 
for any extra system R of arbitrary dimension. Quantum operations defined via the 



30 



Chapter 2 



operator-sum representation satisfy this property (again, see [35] for a proof of this 
fact). 

We can describe measurements as a set of non trace-preserving quantum opera- 
tions {Si}- To illustrate this, let's consider a special kind of generalized measurement, 
called a projective measurement, described by a set of orthogonal projectors {Pi} (i.e 
PiPj = SijPi) satisfying: 

^P. = J^. (2.9) 

i 

The probability of obtaining outcome i for a system A in the state is then given 
by Tr(Pj'?/;^). Alternatively, we could have described projective measurement as the 
set of quantum operations Si^ip"^) := Piip^P} . That these are valid operations follows 
from the completeness equation eq. (12.91) . We obtain the measurement outcome i 
with probability Ti[£i[ip^)) since 

Tr(£:,(^/>^)) = Tr(P,^A^P,) 
= Tr(P,7/;^). 

The normalized state after obtaining outcome i is given by Tr^g^^^l)-) • Notice that 
each quantum operation £i is described using only one Kraus operator and that the 
sum Ylii is a trace-preserving quantum operation. 

Generalized measurements can be described similarly using a set of quantum op- 
erations {£i}, with Eii^j"^) = Miip^Ml We can generalize the previous examples by 
considering quantum operations with operator-sum representations containing more 
than one Kraus operator. We call an instrument X := {Ei} [35] a set of completely pos- 
itive maps (i.e non trace-preserving quantum operation) which sums to a completely 
positive and trace preserving map. The elements of the set {Ei} are the instrument 
components. Instruments can be used in protocols when one party needs to perform 
a measurement followed by an isometry conditioned on the classical outcome of the 
measurement (see Section [3.2.31 in Chapter 3). 

Suppose two parties share a bipartite system AB in the state ^^^^ but have ac- 
cess only to a classical communication channel (i.e., a channel which transmits only 
classical data). The parties can send information by performing a finite number of 
rounds of local measurements (or other local processing such as instruments) and clas- 
sical communication of the outcomes between them. These types of operations are a 
special class of quantum operations known as LOCC (Local Operations and Classical 
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Communication). They can be written elegantly in the operator sum representation 
as: 

^(^^^) = ® Y,^)i,p^^){X^ ® Yfy. (2.10) 

i 

Note, however, that the class of operations which can be written in the previous form 
includes operations which are not in the LOCC class. Quantum operations satisfying 
eq. fl2.10p are called separable. Teleportation and distillation protocols are examples 
of tasks which are performed using LOCC operations. 

2.2.3 Distance measures 

Given two probability distributions p{x) and q{x) over the same index set X, the 
total variation distance between p{x) and q{x) is defined as 

D{p,q) = ^^^Ipix) -q{x)\, 

where |x| is the absolute value of x. The total variation distance D{p, q) is a metric: it 
is non-negative for any distributions p, q, it is symmetric in its arguments {D{p, q) = 
D{q,p)), and it satisfies the triangle inequality: 

D{p,q) < D{p,r) + D{r,q), 

where p, q, and r are arbitrary probability distributions over the same index set. 
The trace distance between two density operators and is given by 

nf A\ . ■'■ II ^A|| 

^(P ) ■= ^\\P - 111, 
with the trace norm ||X||i of an operator X defined as 

||X||i = TrVXtX. 

Here, the \fX function for a positive operator X is defined via the spectral decom- 
position of X: 

VX:=5^V^|e,)(e,|^. 

i 

If p"^ = '^iPi\i){i\^ and = Xlj (^1"^' easy to see that the trace dis- 

tance D{p^,a^) reduces to the total variation distance D{j>,q). The trace distance 
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D^p'^^a'^) extends the total variation distance by providing a measure of closeness 
for states which are not simultaneously diagonalizable. (Two states and 
which are simultaneously diagonalizable can be written as p^ = Xli ^^^1 
= /ij|ej) (cjl^ for a common set of eigenvectors {cj}.) Symmetry and non- 
negativity of the trace distance follows easily from the definition of the trace norm. 
The triangle inequality 

D(p^c^^)<D(p^0^)+D(0^c^^), 

is also satisfied for any state (p^. 

For any two orthogonal states p^ and a^, the trace distance is maximized and 
is equal to 1. The trace distance D(p^,cr^) is equal to zero if and only if the states 
are the same. The trace distance D{p^,a^) is invariant under unitary operations 
performed on p^ and a^: 

D{p^,a^) := D{Up^U\Ua^U^). 

The trace distance can only decrease under trace-preserving quantum operations (a 
property also known as monotonicity) : 

D{E{p^),£{a^))<D{p^,a^). (2.11) 

Another measure of closeness between two states p^ and is obtained via the 
fidelity gSlIlT]: 

F(p^,a^) = Try^Vp^o^Vp^- 
This can be re-expressed using the trace norm as 

F(p^a^) = ||v^v^||i. 

The fidelity between two states p^ and a"^ is equal to one if and only if the states are 
the same. It is always non-negative and is zero for any two orthogonal states and 
a^. It is also invariant under unitary operations performed on p^ and a"^: 

F(p^, a^) = F{Ua^U\ Ua^U^). 
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The trace distance is bounded by the fidehty (see Fuchs and van de Graaf |18] for a 
proof) in the following way: 



1 - F{p, a) < D{p, a)<^l- F\p, a). (2.12) 
The fidelity is increasing under trace-preserving quantum operations: 

An incredible theorem, known as Uhlmann's theorem, relates the fidelity F{p^, a^) 
to a maximization over purifications of p^ and a^: 

Theorem 2.2.1 (Ulhmann's theorem [IE]). Let p^ and be states of a system A. 
Introduce a second system R which is a "copy" of A. Then, 

F(p^a^) = max 1(^10)1, 

where the maximization is over all purifications lip)"^^' of p^ and \(f))^^' of a'^. 

A proof of this theorem can be found in [35]. We will use in the next chapter a 
very useful corollary to Ulhmann's theorem: 

Corollary 2.2.2. Let p"^ and cr^ be states of a system A. Introduce a second system 
R which is a "copy" of A. Then, 

F(p^a^)=max|(V;|0)|, 

m 

where 1?/;)^^ is any fixed purification of p^ , and the maximization is over all purifi- 
cations 10)"^^ of . 



Another important result we will frequently use is the Fannes inequality 
which bounds the difference in the von Neumann entropies of two states p^ and 
clS cl function of their trace distance. 

Lemma 2.2.3 (Fannes Inequality). Let p^ and be states on a d-dimensional 
Hilbert space A. Let e > be such that ||p^ — cr^^Hi < e. Then 



SiA), - SiA)^ 



< ?7(e) logd. 
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where ri{x) = x — xlogx for x < -. When x > we set rj^x) = x + 



2.2.4 Typicality 

Suppose an information source emits a sequence of letters X1X2X3 . . . taken from 
an alphabet X according to some probability distribution p{x). If the source is memo- 
ryless (i.e., each letter in the sequence is an i.i.d. random variable X with probability 
distribution p{x)), we expect the frequency at which each letter x will appear in the 
sequence to depend on the probability weight p{x) associated with the letter x. For a 
sequence x" of n letters X1X2X3 . . . x„, let N{x\x"') be the number of times the letter x 
appears in the sequence x". Given a probability distribution P{x) over the alphabet 
X, we say that the sequence := X1X2X3 ... x„ is of type P if A^(x|x") = nP{x) for 
every letter x. That is, the letter x appears exactly nP{x) times in the sequence x". 
For a number 6 > 0, the set 7^"^ of S-typical sequences of length n for the probabihty 
distribution p(x) is defined as 



Tj^^s := <! : Vx e X, 



— p[x] 



n 



< 6 



Typicality can be exploited to prove theoretical bounds on the achievable rates for 
discrete memoryless sources. (See, for example, the proofs of the source coding the- 
orem and the noisy channel coding theorem [39l |50].) For any e,S > 0, we have for 
sufficiently large values of n: 

P''{V,s) > (2.13) 

2-n{H{X)+c5) < ^n^^n-^ ^ 2-n(H{X)-c&) y^n ^ ^2.14) 

(1 — e)2"^"^'^'^''~'^''-' < IT"^! < 2^''^'''^^~^'^^\ (2-15) 

where p"(x") = nr=i^'(-^«) ^"^^ ^ some positive constant. We refer to [5l] for a 
proof of these statements. 

The classical notion of typicality extends to the quantum setting by considering a 

memoryless quantum source emitting a sequence of unknown states |xi)|x2)|x3) 

with known density operator tp"^ = '^^Px\x){x\^- The source transmits a state 
Ix)"^ with probability p{x). We can alternatively think of the source as emitting 
many copies of the density operator For n copies of the state its spectral 
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decomposition is written as: 

X" 

If we perform a measurement in the basis it follows from the classical notion 
of typicality that, for sufficiently large values of n, we will obtain with very high 
probability a measurement outcome belonging to the set of typical sequences T^^. 
For 5 > 0, we define the 6— typical subspace A^^^ for the density operator ipj^ as 



^5,5 :=span||x") 



The projector into the typical subspace A^^^ is given by: 

In later chapters, we will often abbreviate the typical projector 11^ ^ associated with 
the state ip'^ as 11^, where A is shorthand notation for the typical subspace A^ 

The probability of obtaining a measurement outcome x" G 7^"^ is equal to 
Tr(-?/;|'"n^^^), and since ^"(T^"^) > 1 — e for any 5, e > and sufficiently large n, 
we have 

Tr(^rn;,5) > 1 - e. (2.16) 

We also have the following properties (see Abeyesinghe et al. [26] for proofs of these 
facts), analogous to eqs. (12.140 and (12.150 . for any e, 5 > and sufficiently large 
values of n: 

2-n(5(A)^,+c5)j^n ^ < III ^^i)^^^ < 2-'^(5(^)*-^'5)n;^^ (2.17) 

(1 _ ^^2'^{s{A)4.-c5) ^ Tr(n;^^5) < 2"(^(^)*+^^) (2.18) 
Tr[*|] := Tr[(^^)2] < (i _ g)-22-n(5(A)^-3c5)^ ^2.19) 

where c is some constant and is the normalized state obtained after projecting 
the state ip^"' into the typical subspace ,5: 

■ Tr(n:;,_,v^r ) • 
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The last line is obtained by combining eqs. f l2.16p . ( I2.17P and fl2.18p with the fact that 
Tr[A^] < Tr[i?^] for any two positive operators such that A < B. Typical subspaces 
are a helpful tool when performing asymptotic analysis of quantum protocols (see, 
for example, [121 |2ll |26]). This is due, in part, to the following lemma: 



Lemma 2.2.4 (Gentle Measurement Lemma [52]). Let he a sub-normalized state 
(i.e p"^ > and Tr[p"^] < ly). For any operator < X < / such that Ti[Xp^] > 1 — e, 
we have 



^fXp^^-p' 



< 2v^. 



For a proof of the Gentle Measurement Lemma, see [52]. The better constant 
obtained above is from Ogawa and Nagaoka [53]. An application of the previous 
lemma combined with the triangle inequality yields 



Cgin 
A 



< 2v^ + e < 



for < e < 1. Projecting to the typical subspace preserves the information (up to 
an arbitrarily small loss) contained in the state t/;®" and allows the analysis of the 
asymptotic behavior of most information-processing tasks to become much simpler 
to perform. 

2.3 (Smooth) min- and max-entropies 

For a discrete random variable X taking values in a set {xi, X2, . . . , x„} with 
probability p{xi), the Renyi entropy [5l] of order a, where a 7^ 1, is defined as 



-logf^p(xi 
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When a — 1, we recover the Shannon entropy H{X) = — XliLi^'l^*) ^ogp{xi): 

d_ 

\mvHo,{X) = hm "'^ 



' log(Er=iP(^^)") 



liin Z]^*^^) Er=i^'(^i)"l^^P(^0 



n 



i=l 

The first hne is I'Hopital's rule and the third line follows from ^p(xj)" = p{xi)'^ Inp(xj). 
Taking the limit of Ha{X) as a — )■ oo, we obtain the classical rain-entropy: 

HrainiX) := lim Ha{X) = - logmaxp(xi). 

a—^oo i 

The Renyi entropies were introduced by Renyi in 1961 as alternatives to the Shannon 
entropy as measures of information. The Shannon entropy, viewed more abstractly, 
is also the unique function of a probability distribution which satisfies a precise set 
of postulates (see [M] for a detailed description of these postulates). Renyi extended 
the notion of entropy to more general random variables, often called "incomplete" 
by Renyi because their observation could occur with probability less than one. By 
generalizing the postulates characterizing the Shannon entropy, other information- 
theoretic quantities were obtained, such as the family of Renyi entropies. Applications 
of the Renyi entropies abound in areas such as cryptography [55], [56] and statistics 
[571 EH]. 

Quantum min- and max-entropies are adaptations of the classical Renyi entropies 
of order a when a — )• oo and a = 1/2 respectively. Let S<{AR) be the set of sub- 
normalized density operators (i.e Tr(p'^-^) < 1) acting on the space AR. The quantum 
min- entropy [59] of an operator p"^^ G S<{AR) relative to a density operator is 
given by 

H^Up^^'W'') :=-logA, 

where A is the minimum positive number such that A(/"^ ® cr^) — p"^^ is a positive 
operator. The conditional min-entropy H^i^{A\R)p is obtained by maximizing the 
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previous quantity over the set B{R) of density operators for the system R: 

H^^{A\R)p:= max i7^in(p^^|a^). 

For two sub-normahzed states p and p, we define the purified distance [60] between 
p and p as 



P(p,p) :=y'l-F(p,p)2, 
where F(p, p) is the generahzed fidehty between p and p: 

F(p,p) := F(p,p) + v/(l-Trp)(l-Trp). 

The purified distance is related to the trace distance D{p,p) := i||p — p||i as follows 



D{p, p) < P{p, -p) < 2^D{p,p). (2.20) 

A proof of this fact follows directly from Lemma 6 of [60]. (Lemma 6 actually relates 
the purified distance to the generalized distance -D(p, p). However, -D(p, p) is bounded 
above by 2D{p, p) and bounded below by -D(p, p).) 

Using the purified distance as our measure of closeness, we obtain the family of 
smooth min- entropies {H^:^J^A\R) p} by optimizing over all sub- normalized density 
operators close to p"^^ with respect to P{p,p): 

^min(^l^)p:= , max H^UA\R)p, 

where the maximization is taken over all p^^ such that P[p^^, p^^) < e. Given a 
purification p^^^ of p"^^, with purifying system 5, the family of smooth max-entropies 
{H^^,M\B)p} is defined as 

Hl,M\B)p ■■= -Hl^M\R)p (2.21) 

for any e > 0. When e = 0, an alternative expression for the max-entropy H^i,^{A\B) p 
was obtained by Koenig et al. [61] : 

H^UA\B)p= max logF2(p^«, ® a^), (2.22) 

uB(iB{B) 
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where the maximization is taken over all density operators on the space B. The 
smooth max-entropy can also be expressed as 

H^^^{A\B),= imn H^UA\B)„ (2.23) 

where the minimum is taken over all sub-normalized p^^ such that P[p^^ , p^^) < e. 
We refer to [60] for a proof of this fact. From eq. f l2.22p . the smooth max-entropy 
Hn-inAP^) of ^ sub-normalized operator p^ G S<{A) reduces to 

i/^,Jp^) = 21og5^v^, (2.24) 

X 

where fj, are the eigenvalues of the sub-normalized density operator p^ which opti- 
mizes the right hand side of eq. f l2.23p . 

The smooth min- and max-entropies are also known to satisfy other useful prop- 
erties such as quantum data processing inequalities and concavity of the max-entropy 
(see [HD])- These measures were introduced to characterize information-theoretic tasks 
which tolerate a small error on the desired outcome. In Chapter 4, we describe a pro- 
tocol for the task of multiparty state merging and analyze the entanglement cost 
using smooth min-entropies. 

We also need, for technical reasons (see eq. (14. 8 p found in Chapter 4), another 
entropic quantity called the conditional collision entropy [59] : 



i/2(p^''k^) :=-logTr 



where p^^ is a density operator for the system AB. It is a quantum adaptation of the 
classical conditional collision entropy. The following lemma, proven in [59], relates 
the quantum min-entropy to the collision entropy: 

Lemma 2.3.1. For density operators p^^ and with supp{TrA(p"^^)} C 

supp{(T'^}, we have 

The last two results we will need are the additivity of the min-entropy and the 
following lemma which relates the trace norm of an hermitian operator X to its 
Hilbert-Schmidt norm ||X||2 := A/Tr(XtX), with respect to a positive operator a: 
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Lemma 2.3.2. Let X be an hermitian operator acting on a space A and a be a 
positive operator on A. We have 



\X\\i < ^/f^\\a-^/^Xa-^^' 



4|| 

2- 



Proof Rewrite the right hand side as iyTr(o")Tr(Xcr ^/'^Xa ^/^) and apply Lemma 

5.1.3 of Eg. □ 



Lemma 2.3.3 (Additivity). Let p"^^ and p"^ ^ be sub-normalized density operators 
for the systems AB and A'B' respectively. For density operators and a^' , we have 

Additivity follows straightforwardly from the definition of the quantum min- 
entropy. 

2.4 Previous distillation protocols 

2.4.1 The Schmidt projection method 

One of the first protocols for extracting pure entanglement was devised by Ben- 
nett et al. [62] and works on a supply of partly entangled pure states with 
entropy of entanglement 

Ei^P^^) := S{A)^. 

First, let's assume the states being shared are qubits. The extension of the protocol 
to higher dimensional systems will be straightforward. Using the Schmidt decompo- 
sition, we can write the pure state ip^^ as: 

^^^ = cos(^)|eo/o)+sin(^)|ei/i), 

with < 6 < 7t/2 and sin^^ + cos 6'^ = 1. For the tensor product state iPab^ have 

« = (K) f cos(e)|e^/^) + sm{e)\e\ fl)] . (2.25) 



i=l 



By expanding the right hand side of eq. (12.251) , we get coefficients of the form 

Ai. := cos""'' 6* sin'' 61, 
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for < k < n. Let be the associated projector onto the subspace of dimension 

(«) spanned by the vectors {|e. e^, . . . e?.)} havmg coefficient A. in eq. & Alice 

performs a projective measurement with projectors {Pj^}, yielding the outcome k 
with probabihty 



= , ^j(cos2^)"-'-(sin'^)^ 

By virtue of the original entanglement, Bob will obtain the same value for k if he 
wants to perform his measurement. Hence, the measurement produces a maximally 

entangled state tp^^ in a I J -dimensional subspace of the original 2^*^ dimensional 



space. 

The efficiency of the above procedure can be understood as follows. The expected 
entropy of entanglement YlkP^^ii^k^) ^^e residual states is non increasing under 
local operations (see [38] for a proof). Hence, we must have 

Y,PkS{A)^, < EHjTb) = riEiij^'') = nS{A)^. (2.26) 

k 

The von Neumann entropy of the reduced state ip'^'^ can only increase under projective 
measurements: 

S{A)^^. = nS{A)^ < SiA)^^^„ (2.27) 
where ipo^^ ■= J2kPki^k^ ■ Since we have (see [35] for a proof) 

S{A)^^^,<H{p,) + J2PkSiA)^,, 

k 

we combine with eqs. (12.261) and (12.271) to obtain 

Y.P'^SiA)^, < ni?(^^^) < H{p,) + J2PkS{Ah,. 

k k 

But H{pk) is the entropy of a binomial distribution of n trials with success probability 
p = sin^ 9, which increases only logarithmically with n. Hence, as n -> oo, the 
expected entropy of entanglement converges to nE, and so the original entanglement 
is preserved. 

To convert the residual states into a standard form such as EPR pairs, however, 
we must use a double blocking strategy: partitioning the original input into different 
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subsets and applying the above procedure on each of those subsets. If we have m 
batches of tensor states '^/'l'^, we obtain a sequence of k values ki,k2,...,km by 
applying the previous strategy of projective measurements on each batch. To obtain 
a desired number of singlet states, we need to have an adequate amount of batches 
m at our disposal. Let 




be the product of the binomial combinations yj^j ^'^^ ^^^^ ^ batches and fix 
some e > 0. If Dm lies between 2' and 2'(1 + e), we have enough batches so that we 
can recover / singlets by projecting (Alice) the residual states ip^^ ® ip'^^ ■ ■ ■ ip^m o^^o 
a large space of dimension 2' (followed by some Pauli operators). Otherwise, we will 
need more supply of initial entanglement. 

With probability greater than 1 — e, the projection onto the large space will 
succeed and, by virtue of the entanglement of the residual states, Alice and Bob will 
now share / singlets. With probability less than e, the residual states are projected in 
a smaller subspace of dimension D^, — 2' < 2'e. In such failure is declared and 

the protocol is aborted. As discussed in [62], the converting into product of singlets 
can also be shown to preserve the original entanglement in the limit of large n. 

2.4.2 The hashing method 

The first distillation protocols working on a supply of mixed states ip^^ appeared 
in [38]. If Alice and Bob can communicate classical information, Bennett et al. con- 
structed a strategy, called the hashing method, for producing a non-zero yield of 
pure entanglement if the pairs are drawn from an ensemble of Bell states with known 
density operator: 

i/j"^^ = Po|*+)(*+| +P2|$-)($-| +P3|^-)(^-|, 

with von Neumann entropy S{AB)^ given by the Shannon entropy H(p) of the prob- 
ability distribution {^07^17^27^3}- In this section, we give a brief description of this 
method. Each of the four Bell states can be encoded using 2 classical bits in the 
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following way: 



00 



01 



11. 



10 



(2.28) 



An unknown sequence of n Bell states can then be represented as a bit string of 
length 2n. For instance, the sequence is encoded as 001110. The parity of 

a subset s of the bits in a string x is equal to the modulo-2 sum of the bitwise AND 
between s and x, or equivalently, the Boolean inner product s ■ x. For instance, if we 
have X = 001110 and s = 100100, then the parity of the selected subset 000100 of x 
is equal to 1. 

At the start of the protocol, Alice and Bob share an unknown sequence of n 
Bell states. Let xq be the bit sequence of length 2n corresponding to this unknown 
sequence of Bell states. The hashing method consists oin — m rounds of the following 
procedure: At the start of round A; + l, k = 0,1, . . . ,n — m — l, Alice chooses a random 
subset s of the unknown bit sequence Xk of length 2(n — k) and sends it to Bob. Alice 
and Bob then determine the parity of s by performing local operations on their share 
of the remaining n — k pairs. In [38], it is shown how to obtain the parity of a subset 
s by measuring a single pair of qubits, while preserving the Bell-state nature of the 
remaining pairs. The unmeasured pairs, represented by the classical string Xk+i, will 
change according to some deterministic hash function Xk+i = fsi^k)- At the end of 
the round, Alice and Bob share an unknown sequence Xk+i = fs{xk) oi n — k — 1 Bell 
states. 

For any two possible candidates y ^ z at any given round in the above protocol, 
the probability that they will agree on a random subset s is equal to 1/2. This can 
be seen from the fact that 



where © denotes addition modulo-2. Thus, after each parity measurement is per- 
formed, we can expect on average at least half of the remaining candidates to be 
eliminated. The remaining candidates will be mapped into a set of possible output 
sequences according to the hash function fs{xk)- After r rounds, the probability that 



{s-y)®{s- z) = s-{y®z) 
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two distinct candidates y and z remain distinct is therefore bounded by 2~^, com- 
bining the previous two facts. From typicahty, we know that with arbitrarily good 
probabihty, the unknown string x will be in a set of 2"(^(^-^)*+'5) typical sequences, 
where 6 > can be made arbitrarily small by choosing n large enough. Hence, if we 
perform approximately n — m = n{S{AB)^ + 26) rounds, we can expect to identify a 
single good candidate x with high probability. Failure occurs if after n — m rounds, 
we have more than one candidate or no candidate if we restrict our search within the 
set of typical sequences. 

If a single candidate is found after n — m = n{S{AB)^ + 26) rounds, Alice and 
Bob share n(l — {S{AB)^ + 26)) copies of impure Bell states which can be transformed 
into maximally entangled states by applying suitable Pauli operators: 

-(;;) -(::) -(;:') 

For instance, if the sequence of unknown pairs is |$+) |^_) !$_), Alice and Bob can 
recover three singlet states by applying the Pauli operators J® J on Alice's share 
and X ® I ®X on Bob's share. As the number of copies n grows larger, this strategy 
will produce a yield approaching 1 — S{AB)^ maximally entangled pairs per copy of 
the input state . 

A more recent protocol by Devetak and Winter [29], relying on different ideas, 
extends the previous result to arbitrary mixed entangled states and proves the 

following hashing inequality, named after the hashing method, for one-way distillable 
entanglement D^{ip^^) (see [29] for a formal definition): 

Lemma 2.4.1 (Hashing inequality [381 [29]). Let ip^^ he an arbitrary bipartite mixed 
state. Then, 

D^(^^^) > S{B)^ - S{AB)^ =: I{A)B)^. (2.29) 

The quantity on the right hand side is known as the coherent information from 
A to i? of the state ip^^. For the case of mixtures of Bell states, the coherent 
information evaluates to 1 — S{AB)^, which is exactly the yield attained by the 
hashing method. Devetak and Winter showed how to achieve coherent information 
for arbitrary states [29] by exploiting the connection between entanglement distillation 
and quantum data transmission, for which the coherent information had already been 
demonstrated to be achievable [631 EH |65] . 
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2.4.3 Entanglement of assistance 

Suppose a memoryless quantum source emits an unknown sequence of Bell states 
|\E'_),|\E'+),|$_),|\E'+),. . . with known density operator ^|:^^ and gives one qubit of each 
pair to a laboratory A (Alice), while the other halves are sent to another laboratory 
B (Bob). Suppose each Bell state is produced with equal probability. From Alice and 
Bob's point of view, they share many copies of a maximally mixed state I^^ /A. The 
hashing inequality suggests that it is impossible to distill entanglement for this state. 
If Alice and Bob have no additional information about the source, this assumption 
is correct (see [38] for a proof). If a third party (Charlie) has access to the source, 
however, and tells Alice and Bob the exact sequence of states produced, they can 
recover a singlet state for each pair of qubits shared by applying appropriate 
Pauli operators. 

An equivalent view of the previous example is to assume that Alice, Bob and 
Charlie share the state 



Il22...«n 



where each state ^E'jijj-. jn is a product of Bell states. Each index ij corresponds to 
one of the four Bell states using the encoding of eq. fl2.28p . If Charlie measures his 
system in the basis {\i1i2 ■ ■ ■ in)^}, and sends the measurement outcome to Alice and 
Bob, they can apply Pauli operators to recover n singlet states This is in sharp 
contrast with the results of the hashing method, which gives a zero yield for the 
reduced state ip^^. 

Motivated by this simple example, a natural question to ask is how much ad- 
ditional entanglement can be distilled from a tripartite state ip^^^ if third party 
assistance is available. This has been studied and solved completely [211 [221 [23 [21] 
when the parties share an arbitrary pure tripartite state ip^^'~^ . The entanglement of 
assistance [21] for the pure state ip^^^ is defined as: 

E^(V^^^) := i?^(V^^^^) := max^p,5(Ai?)^^, (2.30) 

i 

where the maximum is over all decompositions £ = {pi, ipf^} of ip^^ = Tic'ip'^^'" into 
a convex combination of pure states . By applying a POVM on the system C, the 
helper Charlie can effect any such pure state convex decomposition p^^ = Piipf^ 
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for Alice and Bob's state [66], and so the quantity Ea maximizes the average amount 
of entanglement that Alice and Bob can distill with help from Charlie. Since Ea is 
not, in general, additive under tensor products [2T], it will often be the case that 
collective measurements performed by Charlie on the joint state ip'^'^ will be more 
beneficial to Alice and Bob than individual measurements on individual copies of ijj'" . 

Define the quantity Ea as the optimal EPR rate distillable between Alice and 
Bob with the help of Charlie under LOCC quantum operations. This was shown by 
Smolin et al. [23] to be equal to the regularization of Ea- 

E^{^^^^) = hm -Ea{^Tbc)- (2-31) 

n— >oo 77, 

Furthermore, a nice simple expression for E'^ in terms of entropic quantities was also 
obtained: 

= mm{S{A)^, S{B)^,}. (2.32) 

The proof of this result is based on a variant of the Holevo-Schumacher- Westmoreland 
(HSW) theorem [671 [68]^. and makes use of random coding in a very original way. 
Write the state ipABC in Schmidt form as J2j consider n copies of 

it: 

where {tpj)^"^" = \'ipji)^"^"\'ipj2)^"^" ■ ■ ■ \'4^jn)^"^" ■ After projecting the system C" 
into a subspace of constant type P, Charlie selects a random code JT" = ( J^, J^, . . . , J^), 
where each J* is a sequence jij2 ■ ■ - in of type P, by performing an appropriate POVM 
{^\tj{oi)){tj{a)\}j^a on his system, where 

and c is a constant chosen so that '^ja^\tj{a)){tj{a)\ = Hp, where Hp is the 
projector onto the type P subspace of C". The HSW theorem guarantees that if the 
number of codewords is a bit less than 2^'^, where x is the Holevo information 

i 

^Thc version of the HSW theorem relevant for proving eq. p.32p appears in [29] . 
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of the channel i — ipf (assuming w.l.o.g. that S{A)^ < S{B)^), then the code will 
be good. That is, with very high probability, both Alice and Bob have good decoders 
for their respective channels i — )■ ipf and i — > tpf. Once Alice and Bob know the code 
selected, they each apply the decoders associated with this code in a coherent way: 
if {D^}^^i is a decoder for Alice, she applies the isometry 

m 

to her system and similarly for Bob. This will produce residual states z/^^'^^' such 
that 

m=l 

where e-27r«am/Af phases introduced by Charlie's POVM. The entropy of entangle- 
ment S{AA'),^ of the residual states can be shown to be arbitrarily close to the rate 
of eq. f l2.32p . A standard distillation protocol, within in a double blocking scheme, 
can then be applied to recover EPR pairs at this rate. 

A simpler proof of eq. fl2.32p appears in [21] and uses state merging to reveal a 
property at the core of third party assisted distillation: If a third party holds the 
purifying system C of a bipartite state ip^^, then he can always transfer his system 
to either Alice or Bob, whichever will result in the least bipartite entanglement. Both 
eqs. f l2.3ip and f l2.32p will also follow from our more general Theorems 15 . 2 . 5 1 and 15.3.2] 
in Chapter 5. 

The previous scenario can be generalized to include multiple helpers who will 
assist Alice and Bob in distilling entanglement. In the multipartite entanglement of 
assistance problem |23] , the task is to distill EPR pairs from a (m + 2)-partite pure 
state ^c'lCa-.-CmAB giiared between two recipients (Alice and Bob) and m other helpers 
Ci, C2, . . . , Cm- If many copies of the input state are available, the optimal EPR rate 
was shown in [24] to be equal to 



^oo(^c.c....c„AB) ^i^siAT)^ =: Emm-cutii^'^''''-''-^''), (2.33) 

where T C {Ci, C2, ■ ■ ■ , Cm} is a subset (i.e a bipartite cut) of the helpers. We denote 
the complement by T := {C1C2 ■ ■ ■ Cm} \ T. We call min7-{S'(y4T)^,} the minimum 
cut entanglement (min-cut entanglement) of the state ipCiC2...Cm.AB ^ ^-j^ recover 
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eq. f l2.33p from our more general Theorem I5.4.4[ 
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Multiparty State Transfer 

3.1 Introduction 

Suppose two parties share n copies of a bipartite mixed state ijj^^ and one of 
the parties (Ahce) wants to transfer her system to the other party (Bob) using 
as httle quantum communication as possible. Consider a purification ip^^^ of this 
state, where R is the purifying (reference) system. An appropriate measure for the 
correlation between Alice's system and the purification system is the quantum mutual 
information: 

/(A; R)^ := S{A)^ + S{R)^ - S{AR)^. 

For the systems A" and i?", we have I{A^\ R^)^s,n = nI{A] R)^ bits of correlation. 
Superdense coding suggests a strategy for transferring the system A" to Bob: encode 
nI{A; R)^ bits of mutual information in a subsystem Ai of dimension ^I{A; R)^ and 
send this subsystem to Bob. This will transfer the correlation between Alice and the 
reference to Bob, leaving Alice with a system A2 decoupled (i.e decorrelated) from the 
reference R. Using his knowledge of the density operator Bob can then recover 
the entire system A" via local operations on the systems AiB"'. 

Abeyesinghe et al. [2B] showed that by applying a random (Haar distributed) 
unitary U : A A^A^ to a subspace A of A", transmission rates arbitrarily close to 
\l{^A\ R)^ are achievable. When Bob receives the Ai system, he holds a purification of 
the system A2-R", which can be taken, by means of an isometry V : AiB^^ BB^\ to 
the original state iPabr with high fidelity. The decoding also distills entanglement, in 
the form of maximally entangled states shared between the sender and the receiver, at 
an ebit rate approaching ^I{A; B)^. This is known as the fully quantum Slepian- Wolf 
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(FQSW) protocol. 

If a quantum channel is not available between Alice and Bob, but they share 
enough entanglement, they can substitute quantum data transmission with telepor- 
tation to achieve the previous task. This is known as quantum state merging and is 
the original formulation of the state transfer problem. The main result obtained by 
Horodecki et al. [251 El] is a proof of the existence of protocols achieving near optimal 
ebit rates, arbitrarily close to the conditional von Neumann entropy S{A\B)^. This 
result is easily derived by modifying an FQSW protocol: teleport the qubits using 
\l{A] R)^ ebits and recycle the distilled entanglement at the end of the protocol. In 
the limit of many copies of the state only a net rate of 

h{A;R)^-h{A;B)^ = S{A\B)^ 

ebits is needed to transmit the system A" with high fidelity. If the conditional entropy 
is negative, the protocol returns ebits for future communication, but still requires 
catalytic entanglement to achieve the transfer. 

Horodecki et al., however, took a more direct approach, similar to what is found in 
Abeyesinghe et al. [26], by considering random (Haar distributed) measurements on 
the A"^ system. The benefit of this approach is to remove the catalytic number of ebits 
needed when adapting an FQSW protocol to perform state merging. In other words, 
when the conditional von Neumann entropy is negative, the existence of protocols 
achieving a state transfer by LOCC only (no injected entanglement) was shown. This 
result has important consequences in the context of entanglement distillation. For 
positive coherent information, there exist state merging protocols which are also one- 
way entanglement distillation protocols. By preprocessing Alice's system to optimize 
the coherent information as much as possible, and applying a state merging protocol, 
near optimal rates are achievable (see Theorem 13 of Devetak and Winter [29]). 

In this chapter, we analyze two extensions of the state transfer problem. First, we 
consider m senders and a decoder /receiver sharing a state ip'^^'^^-.-CmBR^ j^^j^ 
the number of ebits needed for transferring the systems Ci, C2, . . . , Cm to the receiver, 
sometimes equivalently referred to as "merging" the state ip'^i'^'i-'-CmBR ^^le receiver. 
This extension of the state merging task to a more general multiparty setting was 
also analyzed by Horodecki et al. [21] under the name of distributed compression. A 
combination of state merging protocols, initiated by each of the m senders, was shown 
to yield optimal ebit rates for distributed compression. For instance (see Figure [3?T|) . 
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Figure 3.1: Quantum circuit representing a distributed compression protocol involving 
two senders. Solid black lines indicate quantum information and dashed lines 
classical information. The protocol depicted involves the combination of two 
state merging protocols for recovering the systems Cf " and C®" at the re- 
ceiver's location. 

if many copies of a state ip'^^^'^^ are distributed to two parties Ci and C2, the sender 
Ci can first transfer his system to the receiver at a compression rate arbitrarily close 
to the entropy S{Ci)^. The second sender follows by merging his system with the 
receiver at an ebit rate approaching S{C2\Ci)^. If the second sender goes first instead, 
rates close to S{C2)^ and S{Ci\C2) achieved instead. 

These distributed compression protocols, although optimal in the rates, require 
the use of time-sharing for achieving rates which are not corner points of the rate 
region. Time-sharing consists of partitioning a large supply of states and applying 
different protocols to each subset. If a single copy of the state is available, this 
approach becomes impossible. We remedy this problem by showing the existence of 
multiparty merging protocols which work even if the parties share a single copy of the 
input state. That is, they don't require the use of time-sharing for performing the task 
of multiparty merging. This entails the existence of decoders of a more general form 
(see Figure [321) • A side-effect of time-sharing for the distributed compression protocol 
is to restrict the decoding implemented by the receiver to have a composition form 
f/i Ui , . . . f/,-, . A more general form ?7,-, for the decoder allows the distribution 
of the pre-shared entanglement to be different, while achieving the same rates. To 
illustrate this fact, we construct a specific example where our multiparty merging 
protocol allows a different distribution of the catalytic entanglement required for 
merging the state, compared to a distributed compression protocol as discussed in |24j . 
We prove that time-sharing is not required for the case of distributed compression 
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Figure 3.2: Quantum circuit for a multiparty state merging protocol involving three 
senders. The senders perform simultaneous measurements and send their 
measurement outcomes to the receiver. Upon reception, the receiver applies 
a unitary to recover the sender's systems with good fidelity. In the situation 
depicted above, there is also additional entanglement distilled between the 
receiver and the sender C3. 



involving two senders. For the more general task of multiparty state merging of m 
senders with side information at the receiver, we need to extend some of the well- 
known properties of typicality to the multiparty setting in order to show that time- 
sharing is not required. We discuss some of the difficulties in proving such results. In 
the next chapter, we characterize the entanglement cost of multiparty merging when 
a single copy is available to the parties by using the relevant entropic quantities for 
this regime. 

The second part of this chapter describes a slightly more general kind of state 
transfer. We introduce a second receiver A, and for a bipartite partition T C 
{1,2, ...,m} of the senders, we consider the problem of redistributing the state 
^CiC2-..CmABR receivers: The systems which are part of T are sent to 

the receiver A while the systems which are part of T go to the receiver B. We call 
this task a split-transfer for the state ipCiC2...Cm,ABR^ -^^^ important use in the 
context of assisted distillation: we show how to distill entanglement at the min-cut 
entanglement rate of eq. fl2.33p by combining a split-transfer protocol with a standard 
distillation protocol. 

Analyzing multiparty protocols requires the labeling of numerous systems, di- 
mension quantities, measurement outcomes, etc. . . To avoid losing the reader with 
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cumbersome notation, we make the following abbreviations: For a protocol involving 
m senders, we denote by Cm the composite system C1C2 ■ ■ ■ Cm- The capital letter 
M, written as a subscript next to a label, will always denote the composition of m 
objects. For instance, if each of the m senders have an extra system C^, the label 
C^j denotes the composite system C^C^ . . . C^- The symbol T will denote either a 
subset of the m senders or the composite system ^i^q-Ci. It will usually be clear 
which definition applies from the context. The complement of the set T is denoted 
by T, and may also denote the composite system (3)jGr^«- 

3.2 Multiparty state merging 

3.2.1 Definitions and main theorem 

Let A™ : CmC^/ ® BB^j Cjj O B\jBBm be an LOCC quantum channel 
implemented by m senders Ci, C2, . . . , Cm and a decoder /receiver B. Initially, each 
sender has a system Ci and also an ancilla : an extra system of dimension Ki := 
dco. The receiver also has ancillas B^.^, with d^o = dco, and with d^i = dci. The 
systems and B\,j are in the maximally entangled state ® ® . . . ® (^^"^ , 
with the state consisting of log(i^j) ebits shared between the sender and the 
receiver B. After applying the channel, the senders have subsystems C\,C\, . . .Cm 
of Cmi and the receiver holds three systems: B, B\i and i?jv/, with Bm being an 
ancillary system of dimension dsf^j = dcj,j- 

This channel implements multiparty state merging (see Figure 13. 3p for the state 
^CmBR If ^Yie output state (id/j ® A'T^){ip'^^'^^ (g) ^^^') is approximately a tensor 
product of the initial state ip^^BR ^^^^^ maximally entangled states ® ® . . . ® 
shared between the senders and the decoder. Each state is log(Lj) ebits 
shared between the sender Ci and the receiver. In more formal terms, we have the 
following definition: 

Definition 3.2.1 (m-Party Quantum State Merging). Let A™ he defined as in the 
previous paragraphs. We say that A™ is an m-party state merging protocol for the 
state ip'^^BR yjj^if^ error e and entanglement cost ^ := (logi^i — log Li, log J'r2 — 



logL2, ...,\0gKm- 



\0g Lm) if 



,CmBR 



.BmBR 



1 
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Figure 3.3: Picture of the initial and final steps of a multiparty state merging protocol 
involving three senders. The jagged lines indicate maximally entangled pairs 
shared between the receiver and the senders. The solid lines indicate corre- 
lation between the parties and the reference. At the end of the protocol, the 
systems Ci,C2 and C3 are transferred to the receiver. 



where the state iP^mBR corresponds to the initial state i^'-^mBR ^j^^j^ system Bm 
substituted for Cm- If we are given n copies of the same state, ip = (a)^"', the 
entanglement rate ^(cr) is defined as ^(cr) := ^^{ip). 

Before stating the main theorem, we need to define what it means for a rate-tuple 
to be achievable for multiparty merging using LOCC operations. 

Definition 3.2.2 (The Rate Region). We say that the rate-tuple ^ := R2, . . . , Rm) 

is achievable for multiparty merging of the state iP^mBR j^^. ^ > {], we can find 
an N{e) such that for every n > N{e) there exists an m-party state merging protocol 
A™^ acting on ^/;®" g) with error e and entanglement rate approaching We 
call the closure of the set of achievable rate-tuples the rate region. 

For the task of distributed compression (i.e., no side information at the decoder), 
the rate region was characterized in [21] by the inequalities 

Ri > S{T\T)^ for all nonempty subsets T C {1,2,..., m}. (3.1) 

If a rate Ri is negative for an achievable rate-tuple (-Ri, R2, ■ ■ ■ , Rm), close to —nRi 
ebits shared between the sender Cj and the receiver are also returned by the protocol. 
Allowing the receiver to have side information B as well leads to a similar set of 
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equations describing the rate region associated with the task of multiparty state 
merging. 

Theorem 3.2.3 (m-Party Quantum State Merging [M]). Let %1)^mBR ^ pure state 
shared between m senders Ci, C2, . . . , Cm o,nd a receiver B, with purifying system R. 
Then, the rate-tuple ^ := (i?i,-R2, • • • , -Rm) is part of the rate region for multiparty 
merging if and only if the inequality 

(3-2) 

holds for all non empty subsets T C {1, 2, m}. 

The theorem was proved in [2lj by showing that the corner points of the region 
are achievable and then using time-sharing to interpolate between them. We will 
extend the previous result to the one-shot setting in Chapter 4. In Section I3.2.4[ we 
show that for distributed compression of two senders, time-sharing is not required for 
transferring the systems and C2 at any rate satisfying the inequalities of eq. (13. ip . 

3.2.2 The "fully" decoupled approach 

The distributed compression protocol described in [21] achieves multiparty merg- 
ing by transferring the systems one at a time using the two-party state merging 
protocol. This has the consequence of decoupling the senders from the reference only 
at the end of the protocol. To better illustrate this, suppose the first i — 1 senders 
have transferred their systems to the receiver. The original state i^'^mBR ^j^jg pQjj-^^ 
in the protocol can be written as ip'^''^^, where i? is a system of the same dimension 
as the composite system BC1C2 ■ ■ ■ Cj_i and R := • • • CmR is the relative refer- 
ence with respect to the sender Cj. To transfer the system Ci, the sender performs 
an incomplete measurement on his composite system ® Ci, destroying most of the 
correlation with the relative reference R: 

i/^^ ^ t'^' ® ij^, (3.3) 

where ibf' ^ is the reduced state of ibf' for an outcome ji. After this measurement, 
the senders are not entirely decoupled with the reference as correlation may exist 
between the senders Cj+i, Ci_|_2, . . . , Cm and the reference. Since ^ip^^^^ purifies 
the state on the right hand side of eq. (13.31) . with Bi being a system of the same 
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Figure 3.4: The distributed compression protocol of [2l] for the case of three senders. 

Systems part of a relative reference are found within the dotted region. For 
this scenario, we assume the senders consume just enough entanglement to 
transfer their systems. No maximally entangled states are returned by the 
protocol. 

dimension as Cj, there exists an isometry (Corollary I2.2.2p V : BB^ — ?■ BiBB} 
implementable by the receiver which allows the recovery of the system Cj and log(Lj) 
ebits. The protocol continues in this fashion, with Cj+i being the next sender to 
measure his systems (see Figure 13.41) . 

To achieve multiparty state merging when the senders simultaneously measure 
their systems, as in Figure 13. 2[ we require the measurements to "fully" decouple each 
sender from the reference and all other senders. More precisely, suppose each of 
the senders Ci, C2, . . . , Cm performs an incomplete measurement, described by Kraus 
operators P/ : C°Cj — )■ C] mapping C^Ci to a subspace C}. Then, the reduced 
post-measurement states ipj^^^, where Jm '■= {ji, j2, ■ ■ ■ , jm) are the measurement 
outcomes, must satisfy the stronger requirement that for all outcomes Jm 
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where t'"^ is a maximally mixed state of dimension Lj. 

Ft 

Let's consider the case where the state ■i/'j^^ is exactly in the product form 
t'^m (g) The state "i/'j"^*^^^ purifies r*"" (g) ip^^ with purification systems B^jB. 
Another purification of T'~^^^i®ijj^ is also given by (^^^'^ijj^MBR^ where the state ip^^BR 
corresponds to the original state iP^mBB.^ follows from the Schmidt decomposition 
(Theorem I2.1.2P that these two purifications are related by a partial isometry Uj^^j : 
B^^B — > B\jBBm on the receiver's side such that 

(/^Mi^ ® Uj,,)ijff"'''''\I^"^''' ® UjJ = ® 

Hence, if each sender can perfectly decouple his system from the other senders and the 
reference, the sender's systems can be recovered at the receiver's location by applying 
a partial isometry f/j^^ on the systems of the receiver, which will also distill log(Ljv/) 
ebits. 

The previous scenario was ideal, and in general, will not be feasible for most 
states iP'^mBR^ Hence, we relax our decoupling requirement and accept that the 
measurements performed by the senders will perturb the reference up to some 
tolerable disturbance, and that a small dose of correlation between the senders' shares 
might still be present. In more formal terms, a multiparty state merging protocol 
consists of the following steps (also depicted in Figure I3l2l) : 

1. Each sender Ci applies a quantum instrument Xj := to his share of the 
state iP'^mBR ^ _ The instrument components map the space CjCf to a 
subspace C} of dimension Lj. 

2. The senders Ci, C2, . . . , Cm send their classical outputs Jm '■= {ji,j2, ■ ■ ■ ,jm) to 
the decoder B. 

3. The decoder uses his side information ip^, his share of the maximally entangled 
states ^-^^-f, and the classical information Jm to perform a decoding operation 
Vjj^j- : BB'Ij — 7- B\jBBm (i-e a trace-preserving completely positive map (TP- 
CPM)) and recover the state i^'^mBR ^ ^Lm_ 

The state of the systems CljBl^BR after steps 1 and 2 are performed can be written 
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as: 



Jm 

where £j., := £} ^ ® . . . ® S"^ and itR'^'^^'^^ is the normahzed version of the 
state (id'^"^-^^ (g) £j^,,){ip^''''^^^ ® $^^^^). If we restrict the operators £] to consist of 
only one Kraus operator 

Eiip) = Aip{A))^ foralHJ 

and to satisfy 



the outcome states '^^^^^^'^^ are pure and are the result of performing m incomplete 
measurements, one for each sender Cj. 

Proposition 3.2.4 (Compare to Proposition 4 of [21]). Let ipj^l be defined as 

in eq. jS.^^ , with reduced density operator ijjj^l . Define the following quantity: 

where pjj^j is the probability of obtaining the state i^j^^^'''^^ after all the senders have 
performed their instruments. If Qx{ip^^'^^ ®^^'^') < e, then there exists an m-party 
state merging protocol for the state ip^^BR ^j^^j^ error l^fe and entanglement cost 
^ = (logi^i - logLi, logii'2 - log/^2, log i^rn - logl^m), whcrc Li := dfji for all 
1 < i < m. 

Proof The proof of the above statement is very similar to the proof of Proposition 
4 in [21] . We give the full proof here for completeness. Using the relation between 
trace distance and fidelity (see eq. fl2.12p ). we have 
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From Corollary 12.2.21 of Ulhmann's theorem, there exists a partial isometry (i.e., a 
decoding) Uj^^j : B^j^jB — BI^BBm implementable by the receiver such that 

Using the concavity of F (see [35] for a proof) in its first argument, we have 

>1--, 

2' 

where 



is the output state of the protocol. Using the relation between fidehty and trace 
distance once more, we arrive at 



< 2v/e-eV4 < 2^e. 



□ 



3.2.3 Merging by random measurements 

In this section, we give a bound on the decoupling error when a measurement- 
based random coding strategy is used to achieve a multiparty state merging of the 
state iP^mBB.^ The senders Cj will simultaneously measure their systems CjC° using 

~ \P^L~^\ projectors of rank Lj, and a little remainder, followed by a unitary 
mapping the outcome state to a subspace C}. 

Proposition 3.2.5 (One-Shot Multiparty State Merging). Let ^c-a/B^? ^ (^Km ^ 
multipartite state shared between m senders and a receiver B. For each sender Ci, 
there exists an instrument Xj = consisting of Ni := [ J partial isometrics 

of rank Li and one of rank L[ = dc\Ki — NiLi < Li such that the overall decoupling 
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error Qxi'ip'"^'^^ ® <|)^a/^ bounded by 



qt<2 n 

rc{l,2,...,m} ieT 



dc^K, 



\ 



di 



E n 

rc{l,2,...,m} iST 



Tr 



(3.5) 



and there is a merging protocol with error at most 2y/ Aj. 

To prove this proposition, we show that the average decouphng error when the 
senders perform random instruments using the Haar measure is bounded from above 
by the right hand side of eq. (13. 5p . We will need the following technical lemma, which 
generalizes Lemma 6 in j2l] to the case of m senders. The proof follows a similar line 
of reasoning. 

Lemma 3.2.6 (Compare to Lemma 6 in [24]). For each sender Ci, let Qi be a 

projector of dimension Li onto a subspace C} of Ci and Ui a unitary acting on Ci . 
Define the sub-normalized density operator 



where Um '■= Ui 



U(Ci) JV(C2) 



Um.- We have 



< 



{Um 
Lm 



d. 



Cm 



dU 



M 



d. 



Cm 



d 



R 



rc{l,2,...,m} ier 

r^0 



2 

RT 



(3.6) 



where the average is taken over the unitary groups U(Ci), U(C2), . . . , U(Cm,) using the 
Haar measure. Here dUM '■= dUidU2 ■ ■ ■ dUm and J^/(^.-. dUi = 1 for all 1 < i < 



m. 



Proof For the remainder of this proof, write the integral /u^^^-j /^(Cj) ' ' ' /u(c ) dUM 
as J^^^^^^dUM- First, we simplify the integral using Lemma IA.3.3[ found in Ap- 
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pendix A: 



dU, 



V(Cm) 



Tr 



U(Cm) 



V{Cm) 



dU M — Tr 



dU, 



M 



CO^'^'''{UM)dUM 



V{Cm) 



(3.7) 



To evaluate the average of Tr[a;?,i ^(f^M)], we use the swap "trick" (Lemma IA.2.3P 



Tt[uI.JUm)] = Tr 



Tr 



The second hne was obtained using Lemma IA.2.41 By expanding the right hand side 
of this equahty, the average of Tr[a;?,i p(t/Af)] becomes equal to: 



Tr 



U(Cm) 



dU, 



M 



Tr 



.C\,R 



r^RR 



dU. 



M 



Tr 



U(Ca/) 

Tr 



U(Ca/) 



(3.8) 



where the unitary Um is a "copy" of Um which acts on Cm. Observe that the pro- 
jections Qi,Q2, ■ ■ ■ , Qm from the state ui'^m^ were absorbed by the swap operators: 



^c}c} 



Qi ® QiF^'^'Qi ® Qi. 



The average Jui^Cm)^^^^ ® U m^ F^^m'^^m {JJ m ® UM)dUM is expanded using Lemma 
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{UM®UM)\F''MCM){u,,®UM)dUM = (X) / {U,®U,)dU,. 

U(Cm) j=l Jv{Cm) 

(3.9) 

Using Proposition IA.3.71 we have 



{Ui ® Ui)^F^"^"{Ui ® Ui)dUi = rj'^^^^ + SiF''^^^ 



U(Ca/) 



(3.10) 



where the coefficients and Si are given by 

Li{dci — Li) Li 



< 



dcMl^-i) - dl 



Ci 



_ L,{Ldc. - 1) . Lj 



dcM 



(3.11) 



Substituting eqs. (jSS]), ( KTO\i into eq. ([32]), we get 



Tr 



U(Cm) 



dUKf = Tr 



J2 ri'^ n^ -^^ 

TC{l,2,...,m} i^T ieT 



(3.12) 



where the symbol T appearing in ip^-^ denotes the composite system ^i^-j-Ci- When 
T is the empty set, the second hne of the previous equation reduces to YULi '"jTrf?/;^]. 
From eq. ( 13.1ip . we can bound this quantity from above by: 



n^^Tr[V;^]<-^Tr[V^^ 



1=1 



R\ 



Cm 



Tr 
Tr 
Tr 



^2 



R 



a, 



Cm 



u''^'\UM)dUM 



U(Cm) 
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Hence, using eqs. (13. 7p . (13. lip . (I3.12p and the previous bound, we have 



/ 



U(Ca/) 



■Cm 



2 rc{l,2,...,m} i^r i&T '^^ 

T0 



(3.13) 



To obtain a bound on the average of eq. (13.61) . we use Lemma lA. 1.11 



/ 

Jv 



U(Cm) 



dcM 

< LudR 



dU, 



M 



U{Cm) 



M 



rc{i,2,...,m} j^r ier 



< L 



2 du 



rc{l,2,...,m} ier 

r^0 



Finally, using the concavity of the square root function, we have 



Cm 



d. 



Cm 



\ 



d 



R 



rc{l,2,...,m} ieT 



□ 

Proof of Proposition 13.2.51 For each sender Cj, fix Ni := [ '^'^^' J orthogonal 
subspaces Wl,W^, . . . ,Wl^' of dimensions Lj and one subspace Wj^ of dimension 
L'^ = dc^Ki — NiLi < Li. For each subspace W- , let V^"* be an isometry from the 
subspace W- to a subspace C^?^ of dimension Lj. Let := ^''-P/ be partial isometrics, 
where P- is the projector onto the subspace W- . Note that (5° maps to a subspace 
of Cl of dimension L'^ < Lj. Choose m unitaries Ui,U2, ■ ■ ■ ,Um using the Haar 
distribution, with Ui acting on CfCi. Set the instrument Xj for the sender Ci to have 
components ^/(p) := {QlUi) piQiUi^ for < j < iV^ (See Figure [331). 
Define the state 



64 



Chapter 3 




Figure 3.5: Quantum circuit of the inner workings of an instrument X3 performed by the 
sender C3 as described in the proof of Proposition I3.2.5[ 



For an outcome Jm = (ji, j2, ■ ■ ■ ,jm), with ji 7^ for all 1 < i < m, the trace norm 



R 



is bounded from above on average by: 



A^i N2 N„ 



dcM^M 



M 



i=l 



\ 



rc{l,2,...,m} iST 

r^0 



2 

RT 



< 



Li 



rc{l,2,...,m} iST 

r^0 



RT 



(3.14) 



where we have used Lemma 13.2.61 to get the bound, using the fact that 



dcM^M 
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where Vj^j := V^^ ® ® . . . ® is the tensor product of isometries mapping the 
projected subspaces W(^, W^^ , . . . , to Clj, and the state uY^^ is equal to: 

'^Sf (t^A/) ■= {PtUl PtUm ® ® T^"' {Pi'Ui ® . . . ® PtUm ® I'^Y ■ 

Taking the normahsation into account, with Pjj^j{Um) = Tr((X'j"^(?7jvf )) and ip'^^'^ — 



PJ 



LUj^l (Um), we need to show that on average the pj^iiUu) are close to 



Looking at eq. (13.141) and tracing out, we get 



N„ 



M 



dcM^M 



dUM < 



d. 



rc{l,2,...,m} ieT 

r^0 



Hence we obtain, using the triangle inequality, 



N„ 



M, 



dU 



M 



< 2 



df 



E 



rc{l,2,...,m} ieT 



2 

RT 



--: r 



Lastly, we need to consider what happens when at least one sender Ci obtains a 
measurement outcome jj equal to 0. For an outcome Jm, define the subset T{Jm) C 
{1, 2, ...m} such that i G T{Jm) if and only if jj = 0. Also, define the set Z = {Jm '■ 
\T{Jm)\ > 0}. Then, it is easy to show that the cardinality of the set Z is 

1^1= E II^- 

TC{l,2,...,m} i^T 
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cl,R, 



For an outcome Jm G the average probability of the state ujj^ {Um) is given by 
f pj,,{UM)dUM = TT [ Ujff\UM)dUM 

Jv(Ca,) Jv(CAr) 



Tr 



Q°r^'^'(Q°)^ (g) gfr^'^?(Qf)t 



With this formula in hand, and the fact that the trace norm between two states is 
at most two, we recover the left hand side of eq. (13. 5p for the average value of the 
decoupling error Qx{^'~^^'^^ ® ^^^^^): 



Nl N2 Nrr 



n -1*1 J*^ -I'm 



dU 



M 



< 



< 



< 



2 E 



nier-t-;n.^r^-^. 



rc{l,2,...,m.} 

2 E n 

rc{l,2,...,m.} ier 

2 E n 

rc{1.2,...,m} ieT 

r^0 



dc^K, 



+ r 



+ r 



(3.15) 



Hence, there exist instruments Xi,X2, . . . ,Xm implementable by the senders with de- 
coupling error at most the right hand side of this inequality. From Proposition I3.2.4[ 
we recover the second statement of the proposition, and so, we are done. □ 

3.2.4 Asymptotic analysis 

In this section, we analyze the case where the parties have at their disposal 
arbitrarily many copies of the state ip^^i^^. Theorem KT^ was proved in [23] by 
relying on a time-sharing strategy. In this section, we give a proof that our protocol 
requires no time-sharing for the special case of distributed compression involving two 
senders. We then discuss the main difficulty when attempting to generalize our proof 
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technique for the general task of multiparty state merging. We also give examples 
to illustrate the benefits of our protocol over the distributed compression protocol 
of 



Proof of Theorem 13.2.31 for two senders 

To prove the direct statement of the theorem for two senders and no side in- 
formation at the receiver's location, we use Proposition 13.2.51 in combination with 
Schumacher compression [12]. For n copies of the state ip^^'^'^R^ consider the Schu- 
macher compressed state 

\^f.c,R (n^^ ® n^^ ® u^){\^f^^^^r-, (3.16) 

and its normalized version |vl/\CiC2R ._ _^^\q\CiC2R_ ^jj^g systems Ci.Co and R 
are the (5-typical subspaces of C", and _R". The projectors onto these subspaces 
are denoted by n^^^, U^^ and 11^. For any e > and n large enough, we have 

TrlV^rn^) > 1 - e, Tr(^g;^n^J > 1 - e Trl^^a^n^J > 1 - e. 

Using Lemma IA.4.1t which generalizes the "union" bound of Abeyesinghe et al. [26] 
to an arbitrary number of typical projectors, we can bound the norm of Q'-''^^'^^: 

= {^pruc^ ® n^^ ® n^lV^)®" > i - 3e. (3.17) 

The properties for the typical projectors H,^^, 11^^ and 11^ allow us to tightly bound 
the various dimensions and purities appearing in Proposition 13.2.51 by appropriate 
"entropic" formulas. In particular, we have (see Chapter 2) for n large enough and 
any single system F = Ci, C2, -R: 

Tr[^|] < (1 - e)-22-«{s{^)^-35)^ 
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where 5 > is a typicality parameter. Let ^ = {Ri, R2) be any rate-tuple which 
satisfies the inequalities 



-Rl > S{Cl\C2)tl; 

Rl + R2 > S{CiC2)^. 



(3.19) 



We construct a family of multiparty merging protocols on the state vl/CiCaR -with 
vanishing error as follows: If -Rj < 0, the sender Ci performs a random instrument 
using projectors of rank Lj := [2~"^'J (and possibly one of rank L'. < Li). No 
maximally entangled state is shared with the receiver (i.e Ki := 1). If R^ > 0, 
the sender Ci shares a maximally entangled state of rank := [2'"'^'] with 
the receiver and performs a random instrument with rank one projectors {Li := 1). 
From Prop osition 13.2.5] the average decoupling error Qx{'^'"^'"'^ ^ ®$^^) is then 
bounded from above by 



U(Ci) JU{C2) 



< 



2 E n 



Li 



TC{1,2} ieT 

r^0 



+ 2 



rc{i,2} ieT 
T^0 



$2 _ 

RT 



(3.20) 



Since the purities Tr[^f|^J, Tr[^|^J and Tr[^|^^^ 
and one respectively, the previous inequality simplifies to: 



are equal to Tr[*^J,Tr[^|J 



Q^(^^i^2R ^ ^K, ^ ^K2^dUidU2 < 2V^ 



U(Ci) JV{C2) 



1-e 



(3.21) 



where 



2-n.(i?4+5(Ci)^-<5) 2-"(«2+5(C2)^-5) 2-"(^i+^2+5(Ci)^+5(C2)^,-25) 

" (1-e) + (1-e) +~ (1-6)2 

:= ^2-"(^i+'S'(C2)'/^-'S'(R)>A-4<5) _^ 2-"(-^2+S'(Ci)^-S(i?)^-45) _)_ 2-n(Ri+fl2-S'(R)v,-25)_ 

(3.22) 
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From the rate constraints of eq. (13.191) and the subadditivity of the von Neumann 
entropy, we can set the typicahty parameter 5 by choosing n large enough that 

Ri + ^(Cs)^ - S{R)^ - 45 > 

i?2 + S{Ci)^ - SiR)^, - 45 > (3.23) 
R1 + R2- S{R)^ -25>Q 

Hence, for these values of e and S, the bound of eq. (I3.20p vanishes for large values of 
n. By the Gentle Measurement Lemma and the triangle inequality, we have 



< 4:V3e, (3.24) 



and so, if we apply the same protocol on the state (■?^'^i'^2-R^®n^ -^g error of 

(9(-y/e) + 0(2~"^). Since e can be made arbitrarily small, there exists for n large enough, 
a family of multiparty merging protocols with arbitrarily small error and entanglement 
rate approaching Hence, the rate-tuple ^ is achievable. To recover the full 
rate region, we take the closure of the set of rate-tuples satisfying the constraints of 
eq. (I3.19P . This proves the direct part of Theorem 13.2.31 for the case of distributed 
compression involving two senders. 

The converse part of the theorem is easily established for an arbitrary number of 
senders by using the converse statement of the state merging theorem (i.e m = 1) of 
[21]. Suppose the receiver has obtained the systems T and a sender holds the entire 
remaining system T to be transferred. Then, an ebit rate of at least S{T\TB)^ is 
required to transfer T by the converse of the merging theorem of [23] . Obviously, if 
the system T is distributed across \T\ senders and only LOCC operations are allowed, 
a total rate J^i^r^^ least S{T\TB)^ is also needed. □ 

To understand why the previous approach fails to generalize to more than two 
senders, consider the Schumacher compressed state 

\^^fMBR ^n^^ ® H^^ ® . . . ® H^^ ® H^ ® Il^){\i,f-'^^r\ (3.25) 

We would like to bound the various purities and dimensions for this state by appro- 
priate "entropic" formulas as we did earlier for the case of two senders. However, it is 
most likely that the purities Tr[(f2f''")^] are not bounded by 2~'^^^^^'^'>'i'~'^^'^ for all non 
empty subsets T C {l,2,...,m}, and so, we cannot conclude that the decoupling 
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error appearing in eq. (I3.20p vanishes as n — oo. For m = 3, a state of the form 

is probably a better candidate for satisfying the typicahty bounds on the purities of 
the various reduced states of Q'^i'^a^'s. Note that 11^^ in the previous equation is a 
shorthand for /^2C3 ^ 11^^, where projector onto the 5— typical subspace 

for [ip^^)®^. We think this is an issue that will come up often when performing 
asymptotic analysis of multiparty quantum communication protocols. We make the 
following conjecture about typicality in a multiparty scenario: 

Conjecture 3.2.7 (Multiparty typicality conjecture). Consider n copies of an arbi- 
trary multipartite state 7^c'iC2---C™ . Por any fixed e > Oj^r > and n large enough, 
there exists a state vj/C'iCa-.-Cm yjj^ich satisfies 

Tr[(*'^)2] < (1 - /i(e))-'2-"(^(^)'^-^^) 

for all non empty subsets T C {1,2,..., m}. Here, i^(e) and fi{e) are functions of e 
which vanish by choosing arbitrarily small values for e. 

The conjecture is true for m = 2. Readers interested will find the proof in the 
typicality section of Appendix A. (see Proposition IA.4.51 ) 

A simple example of distributed compression for two senders 

To illustrate some of the key differences between the protocols shown to exist 
by the proof of Theorem 13.2.31 and distributed compression protocols as discussed in 
[24], let's consider again the task of distributed compression for two senders sharing 
a state ip'^^'^^R^ with purifying system R. Recall that in distributed compression, the 
receiver has no prior information about the state. For the case of two senders, the 
rate region is described by the inequalities: 

Ri > S{Ci\C2)^ 

R2 > SiC2\Ci)^ (3.26) 



71 



Chapter 3 



Let's consider a very simple state: 



where |\l/_)'^i'^2 is an EPR pair shared between the senders Ci and C2 and |\1>_)'^2-R 
is an EPR pair shared between the reference R and C2. Let's compute the entropies 
of eq. f l3.26p related to the rates -Ri and R2: 



The total entanglement cost for merging is at least one ebit per copy of the input state, 
no matter which protocol is used to perform distributed compression. For achieving 
the rates i?i = and i?2 = 1, we can use our multiparty state merging protocol 
if we inject, prior to performing measurements on the senders, an ebit per copy of 
the input state between the sender C2 and the receiver. A distributed compression 
protocol as in [23], however, will need to inject 2 ebits per copy of the input state 
between the sender C2 and the receiver if the system C2 is first transferred, followed 
by the system Ci. If the system Ci is transferred first instead, then one ebit per 
copy is needed between the sender Ci and the receiver. Thus, the distribution of 
the catalytic entanglement for the distributed compression of [21] is more restricted 
than our multiparty state merging protocol. Time-sharing can be used to achieve the 
rate-tuple (i?!, R2) with i?i = and i?2 = 1, but it may require many more copies of 
the input state to achieve this. 

Distributed compression for three senders 

Our proof of Theorem 13.2.31 is for the case of two senders. We suspect that 
Conjecture 13.2.71 will hold for the case of three senders. Under this assumption, 
we can show a real advantage to using our protocol over a distributed compression 
protocol relying on multiple applications of two-party state merging. For the case of 



S{Ci\C2)^ 
S{C2\Ci)^ 
S{CiC2)il} 







1 



1 
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three senders, the rate region is described by the inequahties: 

Rl > S{Ci\C2C3)jIj 
R2 > S{C2\CiC3)^ 

Rl + R2> S{CiC2\Cs)'^ 
-Rl + -R3 > S{CiC3\C2)^ 

R2 + R3 > S{C2C3\Ci)^ 

Rl + R2 + R3 > S{CiC2C^)^ 

Let's consider the state 

where |0)'^2'^3 := v^|00)'^2'^3 + Vl - A|ll)'^2'^3, with < A < 1, is a pure bipartite 
entangled state with entropy of entanglement: 

E{<P) = S{Cl)^ = -A log A - (1 - A) log(l - A) > 0. 

Let's compute the entropies of eq. f l3.27p related to the rates Ri and R2 as a function 
of S{Cl)^: 

S{Ci\C2C^)^ = —1 
S'(C2|CiC3)^, = — S'(C|)(^ — 1 
S{CiC2\C3)-^ = —S{C^)tf, 

The state is entangled (S'(C|)<^ > 0), hence there exists an achievable rate-tuple 
{Rl, R2, R3), by choosing R3 big enough, satisfying the inequalities of eq. f l3.27p with 
Rl < 0,R2 < and R1 + R2 < 0. Under our initial assumption, there exist multiparty 
merging protocols with arbitrarily small error achieving this rate-tuple if we inject 
(logics) ~ ni?3 ebits between the sender C3 and the receiver. The protocols will 
return around —nRi ebits between the sender Ci and the receiver and approximately 
— ni?2 ebits between the sender C2 and the receiver. No catalytic entanglement is 
required for both of these senders. 

Consider any distributed compression protocol on the other hand. If no catalytic 
entanglement is used, the protocol must first merge the system C3 to the receiver. 
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Figure 3.6: Entanglement structure of the state -^'^I'^aCsfl^ "The need for catalytic entan- 
glement using a distribution compression protocol is depicted. 



even if time-sharing is used. Assuming the receiver has recovered perfectly the system 
C3, the protocol can then merge either Ci or C2. A time-sharing strategy will choose 
to merge C2 on a subset of the input copies, while merging Ci on the other copies. 
The ebit rate must be at least S{C2\C^)^ for merging C2, and at least S{Ci\Cz)^ for 
merging Ci. We have 

S{Ci\C^)-^ = S{Ci)<s,_ = 1, 
S{C2\C3)^ = 1 — S'(C|)</,, 

and so, as long as |0)'^2'-^3 is not maximally entangled (i.e., S{C^)^ 7^ 1), both of 
these entropies are positive. Hence, transferring Ci or C2 to the receiver requires the 
injection of catalytic entanglement. 

This example is better understood by looking at the entanglement structure of 
the state ^'^I'^aCa-R ^ggg Figure 13. 7p . After merging C3, a distributed compression 
protocol will decouple either the system C2 from its relative reference R2 := CiR or 
Ci from its relative reference Ri := C2-R. There are 2 bits of mutual information 
between the system C2 and the relative reference R2. To transfer this correlation 
to the receiver using anything less than a perfectly entangled pair is impossible as 
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Figure 3.7: Using an "entanglement swapping" trick, the multiparty merging protocol 
needs no catalytic entanglement for the senders Ci and C2 to transfer the 
state to the receiver. Some of the labels were removed for clarity. 

superdense coding is optimal [5]. The same reasoning applies for the system Ci. 

To grasp why catalytic entanglement is not necessary for the senders Ci and C2, 
assuming Conjecture 13.2.71 holds, for our multiparty merging protocol, observe that 
the mutual information between the systems C1C2C3 and the reference R is entirely 
concentrated between the systems C3 and R. If we boost the number of ebits shared 
between the receiver and the sender C3, we can decouple the systems C1C2C3 from 
the reference, and generate ebits between the receiver and the systems Ci and C2 
through an "entanglement swapping" effect. This highlights a fundamental difference 
between protocols working on two parties and multipartite protocols: entanglement 
can be produced between two parties by other means than entanglement distillation 
[62 | |38 | |29] or entanglement gambling [69]. This example also exhibits a natural trade- 
off between injecting more entanglement than needed at one place and being able to 
produce entanglement or at the very least transfer other systems without requiring 
additional entanglement. 

3.3 Split transfer 

In the previous sections, we have analyzed and characterized the entanglement 
cost for merging the state %I}'^mBR ^ single receiver (Bob) in the asymptotic setting 
and in the one-shot regime. Here, we modify our initial setup by introducing a second 
decoder A (Alice), who is spatially separated from Bob and also has side information 
about the input state. That is, the senders Ci, C2, ■ ■ ■ , Cm and the two receivers Alice 
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and Bob share a global state ip'^^C2...CmABR ^-^q objective is then to redistribute the 
state ipCiC2...CmABR Aii^g Bob. The motivation for this problem comes from the 
multipartite entanglement of assistance problem [221 121], where the task is to distill 
entanglement in the form of EPR pairs from a (m + 2)-partite pure state ^^c'lCa-.-CmAB 
shared between two recipients (Alice and Bob) and m other helpers Ci, C2, . . . , Cm- 
Recall the formula for the optimal assisted EPR rate: 

^oo^^CiC....c„AB) _ rmYiS{AT)^ =: ^^i„_cut(^^^^^-^'"^^, A : 5), (3.28) 

where T C {Ci, C2, . . . , Cm} is a subset (i.e a bipartite cut) of the helpers. 

The proof that the rate given by eq. ( 13.28P is achievable using LOCC operations 
(i.e., no pre-shared entanglement allowed) consists of showing that the min-cut en- 
tanglement of the state ^c'lCa-.-CmAB jg arbitrarily well preserved after each sender has 
finished performing a random measurement on his system. The procedure described 
in the proof of [21] makes use of a multi-blocking strategy. That is, given n copies of 
the input state ^c'lCa-.-Cm^B^ ^^^^ helper will perform d = n/r random measure- 
ments, each acting on r copies of ^c'lCz-.-CmAB generating a number of possible 
outcomes. For a sequence of measurement outcomes ji, j^, • • • ,jd, we group together 
the residual states corresponding to outcome 1, then group the ones corresponding 
to outcome 2, etc... When this is done, the next helper will perform random mea- 
surements for each of these groups in the same way the first sender proceeded. That 
is, for each group, you need to divide into blocks, and so on. Needless to say, this 
approach fails if few copies are available to the parties. 

It was conjectured by Horodecki et al. [24] that these layers of blocking could 
be removed by letting all the helpers perform simultaneous measurements on their 
respective typical subspaces. Such a strategy would still produce states which preserve 
the min-cut entanglement, thereby providing a way to prove eq. f l3.28p without the 
need for a recursive argument. In this section, we show that if Conj ecture 13 . 2 . 71 is true, 
there exists an LOCC protocol acting on the pure state ^c'iCz-.-C^ab ^j^j^j^ ^ffi\\ send a 
cut Tmin which minimizes S{AT)ip to Alice and its complement to Bob. The protocol 
consists of two parts: First, all the helpers will perform random measurements on 
their typical subspaces and broadcast their outcomes to both decoders. Then, Alice 
will use the classical information coming from the helpers which are part of the cut 
Train and apply an isometry U, while Bob will apply an isometry V depending on 
the outcomes of the helpers belonging to Tmin- This will redistribute the initial state 
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To = C«C« 




LOCC 




Figure 3.8: Picture of the initial and final steps of a split-transfer protocol involving four 
senders. Jagged lines represent maximally entangled states shared between 
the receivers and the senders and solid lines represent correlation between the 
parties. The senders C3 and C4 are transferred to A, while the senders Ci and 
C2 are recovered by the receiver B. At the end of the split-transfer protocol, 
we have log(A''3) ebits shared between C3 and B for this particular example. 



to Alice and Bob. Standard distillation protocols, as described in chapter 2, on the 
recovered state will yield EPR pairs at the min-cut rate of eq. fl3.28p . 

The following definition of a split-transfer adapts the multiparty merging defini- 
tion to the case of two receivers. We follow the notational convention established in 
the previous sections for labeling the various systems, dimensions, etc. . . Figure 13.81 
helps making sense of all the labels. 

Definition 3.3.1 (Split-Transfer). Let ^t^tbr _j_ 2) -partite state, and as- 

sume the senders and the decoders share maximally entangled states ^-^'t' := ^^^^^^^ 
and r^V := (g)^^_^^^^ We call the LOCC operation M : TT^TT^ ® AA^^® BB2^ 

Aq-AA-f B^i^B Bj^ a split transfer for the state ilfATSR ^^^^^ error e and entan- 
glement costs Er{ip) := 0.g^(logi^i - log Li) and -E^(^) := 0.g:^(logMj - logA^j) 
if 



'^idR ® M){^^^™ ® ® r^^r) - ^ArAB^BR ® ® 



< e, (3.29) 



where ^^'^ := (3)jg7- '^''^S T^r := (^^g^F^^ are maximally entangled states distributed 
appropriately between the senders and the receivers. The systems Aj- and B^ are an- 
cillary systems of the same size as T and T and are held by Alice and Bob respectively. 
For the state ^ := (^^'TATBR^(^n ^ entanglement rates RrW o^f^d R^i^^) ^^"^^ defined 
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as and ^^rW- 

In the above definition, we have denoted by ^^^^(log-R'j — logLj) a vector of 
length |T| whose components are given by logi^"j — logLj for i G T. 

The rate region of a spht-transfer for the state ^'T^t^^ is defined in a manner 
analogous to definition I3.2.2I We omit the details here, but whenever we will say that 
a rate is achievable for a split-transfer of the state ip^^^^^, it means that it is strictly 
contained in the rate region (i.e., not on the boundary). 

3.3.1 Decoupling relative references 

We saw in the previous sections how the distributed compression protocol of 
achieves a multiparty merging for the state ip^f'i^^ by decoupling each sender from its 
relative reference one at a time. We use this approach here to show the existence of 
good decoders for the receivers when the senders perform simultaneous measurements. 
The main technical difficulty is to formally prove that simultaneous measurements 
by the senders still produce a state allowing a good (i.e., high fidelity) recovery of 
the initial state by the receivers. We extend Proposition 13.2.41 to our present scenario 
by following a similar route to that of section 13.2.21 We begin by analyzing an ideal 
situation. 

For a pure state ip'T^T^^^ suppose each sender performs an incomplete measure- 
ment on their respective shares of the state. For a measurement outcome Jm '■ = 
(ji, J2, ■■■Jm), define the state 



^ (3.30) 



=: -^{Plr ®Pl® /^^^)|V^^^™), 
\JVJm ^ 

where the Kraus operators Pj^ = 'Stie^T^ji -^j^ ~ ^i&T^ji spaces T 

and T to the subspaces and T^. Related to this state is the Rj-- relative state 



/PjT 

where Rj- := TBR and pj^ is the probability of getting the outcome jf. If each 
sender in T perfectly decouples his system from the relative reference Rj- and the 
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other senders in T, we have 



^T^^Rt = ^T- ^Rr ^ (3.31) 

where r^^ = (SJier^*"'^ maximally mixed state of dimension Lj- := Yiier^c^ 

on the system T^. From the Schmidt decomposition, there exists an isometry Uj^ : 
A Alj-Aq-A which Alice can implement such that 

(/■^'^^ ® Uf^)]^]"/""^) = 1$^^) ® IV^^^^^^), (3.32) 

where the state \%1}^tARt\i jg same as the original state ^ijjT^TBR'j with the ancillary 
system Aj- substituted for T. 

After merging the systems T to the receiver A, the senders T follow with their 
measurements. Define the RY-relative state 



where Ry '■= Aq-AR. Assume once again that each sender perfectly decouples his 
system from the relative reference and the other senders in T. We have 



ryr = ^ ^Rr^ (3 33) 



where r^^ = (^jg7=T'"» is the maximally mixed state of dimension Nj- := YlieT^c^ 
on the system T^. From the Schmidt decomposition, there exists an isometry Vj^ : 
B —7- B^ByB implementable by Bob such that 



J J- jj- 

where the state I^^^t-^-B^B-R) jg ^-j-^g same as the original state ip^^TBR ^^m^ i\^q ancil- 
lary systems Aj- and B^ substituted for T and T. 

If the senders perform their measurements simultaneously instead, the initial 
state is recovered by applying the isometries and V^^ to the the outcome state 
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1 



V P^A/ 

(3.35) 

Since the states {r'^R ® Uf^ ® V^-p l^/'J^'/'^^-^) and |$^^) ® |r^-) ® |^^r^-B-i?R^ 
both normahzed, we have pj^^ = Pj^pj—. Hence, for this ideal scenario, the decodings 
implemented by the receivers for a split-transfer protocol which decouples the senders 
from their relative references one at a time can also be used to recover the initial state 
if the measurements performed by the senders are done at the same time. This agrees 
with our intuition that the choice of the decoder applied by the receiver A (resp. B) 
should not depend on the measurement outcomes of the systems T (resp. T). We 
resume the previous ideas in the following adaptation of Proposition 13.2.41 

Proposition 3.3.2 (Conditions for a Split-Transfer). Let ip'T^'T^^ he a multipartite 
state shared between m senders and two receivers. Suppose the senders simultaneously 
perform incomplete measurements on their systems, yielding a state {tpj^"^'^^^^) := 



:(P,y (8) PJL ® I^BR^^^TATBR^ OUtcomC J, 



M- 



Using the notation of the previous paragraphs, define the decoupling errors Q^ijp'^^^'^') 
and Qli^lF^^T): 



^ — (3-36) 
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If Qi ^ ^ ^''^d Qx < e', then there exists a split-transfer for the state ip'T^'T^^ with er- 
2-ye + 2-v/e' and entanglement costs Ej- = 0.g^(— logLj), i?:^ = 0^g:^(— log A'j). 



ror 



Proof Apply Proposition I3.2.41 for each of the decoupling errors and Qj. Since 



these are bounded by e and e' there exist isometries Uf and VI such that 



ir 



< 2v^ (3.37) 

1 

< 2v^. (3.38) 



If we apply the isometries f/j^ and V^^ to the outcome state I'ipJ^/^'^^^^) , we have 



- (/ ® \/p(Pf ® /)(/ ® t//,)^r/^M/ ® ® ® 

=:-M(C) 



where C := ^j^Pjri^ ® ^jr^^Jr^^''' ® ^tr^'^ ' '"^^ ^^^'^ output state we 

would get if only the senders in T wanted to transfer their systems to the receiver A. 
The map M., as defined above, is a trace-preserving LOCC operation (i.e., measure- 
ments by the senders in T followed by an isometry on B). Note that we removed 
some of the superscript notation for the sake of clarity. 

To bound the trace distance between the output state p and the state ilj^T^BrfSR^ 

(g) r^r , we introduce the following intermediate state 



It 

j^^^ArATBR 



.«)t 



(3.39) 
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and apply the triangle inequality 





< 


p-a 


+ 




1 




1 



a - ^ArAB^BR ^ ^Lr ^ 



From eq. fl3.38p . the trace norm 



a- - ^ArAB^BR ^ ^Lr ^ pTV- 



above by 2\/e'. To bound \\p — a\\i, we have 



(3.40) 

is bounded from 



\p-(^\\i 



< 



^ _ ^ArATBR ^ ^Lr 



(3.41) 



< 2v^. 



The first inequality holds since the trace distance is non-increasing under quantum 
operations, and the second inequality is just eq. (I3.37p . Thus, we have a split-transfer 
for the state ipCiC2...c^ABR ^^^^i error 2^/e + 2^^. □ 

3.3.2 Split-transfer by random measurements 

With this result in hand, a one-shot split-transfer protocol for the state ipT^TBR 
is obtained by two independent applications of Proposition I3.2.5[ followed by an 
application of Proposition 13.3.21 We state the result here. 

Proposition 3.3.3 (One-Shot Split-Transfer). Let ipT^TBR ^ multipartite state 
for m senders and two receivers. For each sender Ci in the cut T, there exists an 
instrument Xi = {£^j}jlQ consisting of Fi = [ '^^j^^' J partial isometrics of rank Li and 
one of rank L[ = dc.Ki — FiLi < Li such that the decoupling error Q^-^ip'^^^''' ® $^''") 
is bounded by 



TARt 



$^r) < 2 5^ n 



RrX 



(3.42) 



Similarly, for each sender Ci in the cut T, there exists an instrument Xj = {£^*}^q 



consisting ofCi = [-^i^J partial isometrics of rank Ni and one of rank N- = ddMi — 
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GiNi < Ni such that the decoupling error Q'^i^if^^T (g) T^^t) is hounded by 



«r 11 Mi 



-^l-y 



-■■ A!. 



(3.43) 



Finally, there exists a split-transfer for the state ^'T^'^^^ with error 2 a/ Aj + 2 1 
r/ie /e/( /ianc? szfies o/ egs. Iji3.4^ and Iji3.43\ ) are bounded from above on average by 



their right hand sides if we perform random instruments on all the senders using the 
Haar measure. 

Proof The bounds on the decouphng errors Q\ and are obtained by two inde- 
pendent apphcations of Proposition I3.2.5I We leave the details to the reader. The 
existence of a split-transfer with error 2 A ^ + 2 a/ A|- follows from Proposition 13.3.21 
Note here that since the senders have additional entanglement at their disposal, the 
partial isometries Pj^ and in Proposition 13.3.21 are replaced by -P,y^° and PjlJ^ ■ 



These will act on the spaces TT and TT° respectively, with output spaces corre- 
sponding to T^ and T^. □ 

3.3.3 Asymptotic analysis 

The asymptotic analysis for the split-transfer problem is done using the approach 
of Section r3.2.4l by treating each decoupling error separately. We arrive at a variation 
on Theorem 13.2.31 

Theorem 3.3.4. Let ipT^T^^ be a multipartite state shared between m senders and 
two receivers. The rates Rri'ip) '■= ©jg7-(-Ri) o-'>T'd RfW '■= ©ier(-^«) ^'^^ 
region for a split-transfer of the state ^t^'T^^ iff the inequalities 



(3.44) 
(3.45) 



hold for all non-empty subsets X (IT and y (^ T. 



Proof By Proposition I3.3.2[ the task of split-transfer is equivalent to performing 
simultaneously two multiparty state merging protocols, each one involving different 



83 



Chapter 3 



parties. If we view the state ipT'^T'^^ as a multipartite state consisting of senders Ci 
belonging to the set T, a receiver A and a reference RBT, we can apply Theorem 
l3.2.3l to show the existence of multiparty merging protocols with arbitrarily small error 
and an entanglement rate satisfying the inequalities of eq. fl3.44p . Viewing instead the 
state qp'T^'T^^ as a multipartite state consisting of senders Ci belonging to the set T, a 
receiver B and a reference RAT., Theorem 13.2.31 tells us of the existence of multiparty 
merging protocols with arbitrarily small error and an entanglement rate satisfying 
eq. fl3.44p . For corner points, we can applying these protocols simultaneously, and we 
have, by Proposition 13. 3. 2^ a family of split-transfer protocols with vanishing error as 
n grows larger. The other rates of the regions described by eqs. fl3.44p and fl3.45p are 
achieved by using a time-sharing strategy. This proves the direct part. 

To prove the converse, consider any cut X of the senders in T and look at the 
preservation of the entanglement across the cut AX vs XTBR. Assume, for technical 
reasons, that Lj < 2'^^'"^ for all i E 1~. The initial entropy of entanglement across the 
cut AX vs XTBR is 

:= nS{AX)^ + log K^. (3.46) 

At the end of any LOCC operation on the state (^ip'T'ATBR^®n ^ ^-^q output state is 
an ensemble {qk.il^^ , , — i , I of pure states. Using monotonicity of the 

entropy of entanglement under LOCC [38], we have 

nS{AX)^ + J]logJ^i >Y,(lkS(x'A\A^A^)^,, (3.47) 

i&X k 

where X^ := <S>i(zxCi- split-transfer of the state [^TATBR-^^^u ^\^\^ error e, 

we have 

E '?^^^'(<MVA^A^r^B^B^..z.^- «ri.B^H ® ® r^^) > (1 - e/2)l (3.48) 

k I I T T 

This follows from the definition of a split-transfer (eq. fl3.29p ) and the fact that 
is linear when one argument is pure. Using the relation between trace distance and 
fidelity (eq. f l2.12p ). and the convexity of the function, we rewrite this as 



Qk 



^r^A^^A"Ar^T'B^B"B-R- ^AArBB-R ® "^^^ ® 



< 2Ve(l -e/4). (3.49) 
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By monotonicity of the trace norm under partial tracing, we get 



k 



< 2^/e{l-e/A). (3.50) 



Using the Fannes inequahty (Lemma I2.2.3P and the concavity of the ?7-function, we 
have 



J^Qk 5(A''a^AM^)^. - ^logL, - nS{AXX\ 

< (2 J2 log Li + n log dA + n log rfAr)^(2 Ve(l - e/4)) 



< 0{n)ri{2^/e{l-e/A)). 
Finally, using eq. fl3.47p . we have 

J2^^ = Y1 -(log^^. - log^.) > SiX\XA)^, - 0(l)r/(2v/e(l-e/4)) (3.51) 



n 



for any non empty subset A" C 7". Using a similar argumentation, we can show that 
J2^^ = Y1 -(log^* - log^^) > S{y\yB)^ - 0(l)r/(2v/e(l - e/4)) (3.52) 

holds for any non empty subset y ^ T. By letting n — )■ oo and e — 0, we get the 
converse. □ 

3.3.4 An application: entanglement of assistance 

We are now ready to give a protocol for distilling entanglement at the min-cut 
rate Emin-cutii^'"^'"^'"'"'"'^^) ■ If the multiparty typicality conjecture holds, which we 
recall is a statement on the extension of well-known typicality properties to a mul- 
tiparty scenario, we can answer the conjecture of [23]: With high probability, the 
min-cut entanglement Emin-cuti'ip'"^'^''^'"'^'™'^^) of the state ^c'lCa-.-Cm^B arbitrarily 
well-preserved after the helpers Ci,C2, ■ ■ ■ ,Cm perform simultaneous random mea- 
surements on their typical subspaces. 

Theorem 3.3.5. Let ^c:iC2-CmAB ^ multipartite pure state with m helpers and 
two receivers. Given arbitrarily many copies ^C'lCa-.-CmAS^ there exists an LOCC 
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protocol achieving the optimal "assisted" EPR rate: 

E^{4j, A:B) = mm{S{Ar)^} (3.53) 

Proof Denote by Tmm ^ {1? 2, ... , m} the min-cut of the smallest possible size such 
that: 

Vr C {Ci, ^2, . . . , Cm] ■■ S{ATrn,n)^ < SiAT)^. (3.54) 
When Train is not the empty set, we have, for any non-empty subset X C Tmin- 

S{X\XA)^ = S{%ninA)^ — S{XTmmB)i(, 
< S{XA),^ — S{XTm\nB)^ 

_ _ (3.55) 

= S{XTnimB)^ — S{XTniinB)^ 

= 0, 

where in the second line we have used the fact that S{AT^ia)i(> < S{AT) when T is 
a smaller cut than T^in. If TJnm 7^ {1)2,. we have, for any non-empty subset 

y C Train. 

Siy\yB).^ = SCTnrinB)^, - SiyTrninA)^ 

= S{TnnnA)^ - S{yTnnnA)^ (3.56) 

< 0. 

Since S'(A'|A'y4) is negative for all non empty subsets X C Tmin^, there are negative 
rates (i.e Rr^^in = ©ie7;,,i„(-R«) where -Rj < for all i G T^nin) achievable for the 
helpers in Trnnr by Theorem 13.3.41 Hence, by Conjecture I3.2.7[ there is no need for 
additional injection of entanglement for these helpers. To distill entanglement, we 
can set their random instruments with projectors of rank L, = [2~"^'J^. If the 
conditional entropies S{y\yB)^ are negative for all non empty subsets y C Tann^ we 
proceed similarly for the helpers in Tmin- Then, the average decoupling errors Q\ and 
Q\ will vanish as n grows larger, and this without injecting additional entanglement. 



^Thc previous protocol of [24] for the multipartite entanglement of assistance restricted the 
helpers to measurements with rank one projectors. 
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Using Markov's inequality, we have, for any ei, €2 > 0: 



where the averages are respectively taken over the unitary groups U(C*j),z G T^in 

and U(C*j),z G Tmin- The states (^^^'"^'^""" and t^j^'" are the outcome states 
following instruments performed by the helpers. Since the decoupling errors can 
be made arbitrarily small as n grows larger, these probabilities will also vanish on 
average. Hence, there exist instruments performed by the senders which allow, with 
arbitrarily high probability, the redistribution of the original state to the two receivers, 
with TJnin going to A and its complement to B. This in turn implies that the min-cut 
entanglement must be arbitrarily well-preserved when the senders perform random 
instruments on their typical subspaces. 

When some of the conditional entropies S{y\yS)^ are zero, there is no split- 
transfer protocol achieving negative rates for all the helpers in Tmin- We leave the 
conjecture open for these cases. However, it is still possible to redistribute the original 
state and preserve the min-cut entanglement by injecting an arbitrarily small number 
of singlets between the cut y vs By . Another alternative is to produce entanglement 
using a subset of the initial number of copies available to the parties. The procedure 
works as follows: First, the helpers in Tmin transfer their systems to the receiver A 
using a multiparty merging protocol. As the conditional entropies S{X\XA)^ are 
negative for the minimum cut, this can be achieved using only LOCC operations. 
The receiver A then performs a measurement on AA-j-^.^ of the kind described in the 
entanglement of assistance protocol of [23] (see Chapter 2 also). The helper is AAj-^^.^ 
for our setting and the recipients of the entanglement are y and By. Since y and By 
are not individual parties, the decoding part of the protocol cannot be implemented. 
The amount of entropy of entanglement for an outcome state tpj, however, should be 
arbitrarily close to 

E{tlj,)^mm{S{y)^,S{By)^} 

by the formula of eq. f l2.32p . If the minimum is zero, the systems in y are in a known 
pure state ip'^ (i.e S(Y)^ = 0) as, by hypothesis, the conditional entropy S{y\yB)^ 
is zero. The receiver can then locally prepare a system Ay in the state tp-^ . 
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If the entropy of entanglement is positive for the outcome state, the helpers part of 
Tmin can exploit this entanglement to merge their state by LOCC. An entangled state 
shared between y and By , with entropy of entanglement E{'il)) := A, contributes —A 
to S{y\yB)^, making negative rates achievable for the helpers in Tmin- Thus, the 
overall protocol still uses only LOCC operations and produces a state ip 
such that 



' mm — yip ' mm 1 



< 



A^B^, B" ( .Ar . AB^ By. 

'mill ' min — i'lp 'mm ' min J 



< e, (3.57) 



where ijj^'^'^i'^^^T'niin^ is the original state ipTrainATminB ^j^j^ systems Aj-^^^ and 
-^Tmiii substituted for the systems Tmin and Tmin- Applying the Fannes inequality, we 
have 



^ ' mm 

which implies that 



nSiAr . A'^ 

V ' mm 



< n\og{dAr ■ dA)r]{e) 



(3.58) 



S{A}-_A-)^ = n(^(A7;„,. A)^ ±S) = n(^(ATmin)^ ± 5), 



(3.59) 



where S can be made arbitrarily small by letting e — !■ 0. Alice and Bob can distill 
arbitrarily close to the min-cut rate by applying an entanglement distillation protocol, 

r^^T^ B— B^ 

as in [22], on the state Lf ''"min . □ 
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Entanglement Cost of Multiparty State 

Transfer 



Most information processing tasks are analyzed under the assumptions of indepen- 
dence and repeatability. For instance, the noiseless channel coding theorem of Shan- 
non starts from the premise that a source emits an infinite sequence of i.i.d. random 
variables according to a known distribution p{x). In the quantum regime, Schu- 
macher compression assumes that the quantum source is producing many identical 
copies of a state p. For such scenarios, the relevant measure of information is the von 
Neumann entropy. If we drop either of these two assumptions, the well-known formu- 
las for operational quantities such as the minimum compression length of a message 
or the capacity of a noisy channel are not applicable anymore, and other measures 
of information such as the spectral entropy rates of [70] and the smooth min- and 
max-entropies of Renner [59] become more appropriate. 

When a single copy of a state ip^^^ is available, it was found in [27] that the 
smooth max-entropy is the information theoretic measure which characterizes the en- 
tanglement cost associated with the task of state merging. The minimal entanglement 
cost log Ki — log Li is bounded from below in the one-shot regime by 



where Hl^^y-{A\B)^, is an alternative version of the smooth max-entropy, which opti- 
mizes the max-entropy over all density operators ij)^^ close in the trace distance (i.e 
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D{ip^^ ,ip^^) < e). Whenever the entanglement cost""^ satisfies 

logi^'i - logLi > Ht^{A\B)^ + 41og Q + 12, (4.2) 

there exists a state merging protocol for the state ip"^^^ with error e. This result was 
derived by re-expressing the upper bound of Proposition 13.2.51 (when m = 1) as a 
function of the smooth min-entropy. 

In this chapter, we extend some of the results of Berta [27] to the multiparty set- 
ting, where m senders and a receiver B share a state ifj^^'^^^ with purifying system R. 
We give a partial description of the entanglement cost region for multiparty merging 
of the state i^'^mBR^ q^^, main contribution is to characterize a subset of the entan- 
glement cost region where multiparty merging for the state iP^mBB. achievable with 
error e. For any entanglement cost tuple (log i^i — log Li, log A''2 — log L2, . . . , log Km — 
\ogLm) satisfying 

log Ar - logLr := J] log > -iJ^i,(^/'^^|^/'«) + 41og + 2m + 8, (4.3) 

for all subsets T C {1,2,..., m}, we show the existence of multiparty merging pro- 
tocols with error e, where the senders measure their systems simultaneously and the 
decoding is not restricted to a composition form Uj^Uj^_^ . . .Ui. We can also show 
a similar result when we have m senders and two receivers (split-transfer). 

We also consider a different protocol where each sender merges his system one 
at a time as in the distributed compression protocol of [23]. The advantage in this 
approach is that it allows us to use the better entanglement costs proven in Dupuis 
et al. [28] for one-shot state merging. We can achieve multiparty merging of the state 
^CmBR error e whenever the entanglement costs satisfy 

log f j > -H^ {C,\R^-.^))^ + 41og f — j + 21og(13) for all 1 < ^ < m, 

(4.4) 

where -R^-i(j) is the relative reference for the sender Ci with respect to an ordering 
TT : {1, 2, . . . , m} of the senders. 

The last part of this chapter is devoted to examples for one-shot distributed com- 



^The numbers Ki,Li are natural numbers, and so we must choose values for Ki and Li such 
that logKi — logLi is minimal, but greater or equal than the right hand side of eq. (|4.2p . 
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pression. We compute bounds on the entanglement cost for our two protocols and 
discuss some of the shortcomings to using one-shot two-party state merging protocols 
for achieving the task of multiparty state merging. The last example considers a 
family of states for which smoothing has little effect on the min-entropies appearing 
in eq. f l4.4p . The entanglement contained in such states does not reduce to bipar- 
tite entanglement between some of the subsystems, making it harder to analyze the 
entanglement costs of merging. We can nonetheless get interesting bounds on the 
entanglement cost by using the Gershgorin Circle Theorem [711 [72], a standard result 
in matrix theory. 

4.1 Achieving one-shot multiparty state transfer 



Our first result is a reformulation of Lemma 13.2.61 in Chapter 3 as a function of 
min-entropies: 

Lemma 4.1.1 (Compare to Lemma 4.5 of [27]). For each sender Ci, let Pi : Ci — t- C} 

he a projector of dimension Li onto a suhspace C} of Ci and Ui a unitary acting on 
Ci. Define the sub-normalized state 



R\j,CmBR( 



IfUi,U2,..., Um are Haar distributed unitaries, then for any state of the system R, 
we have 



V[Cm) 



d, 



■Cm 



dUM < 



d, 



Cm 



\ rc{l,2,...,m} 

(4.5) 



Proof By Lemma I2.3.2[ we have, for any state of the reference R, 



d, 



''Cm 



M 



(/^M®(T^^)(u;^M«(f/^,,) 



i\(,.,Cl,Ri 



''Cm 



r^M (g) tpR')(I^M (g) cr^-*) 



2 

(4.6) 
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Define the states 



and rewrite eq. fl4.6p as 



■Cm 



< A 




1 





''Cm 



Using eq. fl3.13p in the proof of Lemma I3.2.6[ we have 

L 



U(Cm) 



(^Cm 



R 



2 rc{l,2,...,m} i^r iGT 



^TR 



rc{l,2,...,m} 

r^0 



(4.7) 



The quantity Tr[?/'^^] is rewritten as: 



Tr [■?/;. 



tr] 



Tr 



Tr 



CmR] 



2n 



TR/tT 



(4.8) 



where H2{ip^^\<J^) is the conditional collision entropy of ip'^^ relative to a^. Com- 
bining eqs. dUD, (iZD, dUD and using the fact that H.^i^i^p'^^la^) < H2iip'^^\(J^) 
(Lemma 12.3. ip and the square root function is concave, we have 



V{Cm) 



uj^IiR{Um) - ^^r^M (g) 



d, 



Cm 



dUM < 



< 



L 



M 



dcM 
Lm 



\ rc{l,2,...,m} 



d, 



Cm 



^ 2-{Hu.iM'^^\'yii)-iogLr) ^ 

1,2,... 

(4.9) 



\ rc{l,2,...,m} 
\ 7 
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and so we are done. 



□ 



Using this result, we can rephrase Proposition 13.2.51 in the language of min- 
entropies. This result extends Lemma 4.6 in [27] to the multiparty setting. 



Theorem 4.1.2 (Compare to Lemma 4.6 of [27]). Let ip^^BR ^ multipartite state 
shared by m senders and a receiver, with purifying system R. If, for any e > 0, the 
entanglement cost ^ = (logi^i — log Li, log K2 — \ogL2, . . . , logi^m — log Lm) satisfies 

log J^r - logLr := l^log > -i/^in(^^^|^^) + 41og + 2m + 8 (4.IO) 

for all non-empty subsets T C {1,2,..., m}, there exists a state merging protocol for 
the state ip'-'^BB. yjnf^ ^fror e. 

Proof We fix a random instrument for each sender Cj as in Proposition 13.2.51 each 
sender Cj has an instrument with Ni := [ '^^j^^' J partial isometrics P/ = QjUi, where 
Ui is a Haar distributed unitary acting on the system CiCf. Recall that Qj is a 
partial isometry mapping to a subspace of CjCf . If ddKi > NiLi, there is an 
extra partial isometry P° of rank L'. := dc^Ki — N^Li < L^. Applying Lemma [4.1.11 
for the state ip^'^'^ (g) r'^", with = ip^, and using additivity of the min-entropy 
(Lemma I2.3.3p . we have 



C\jR 
Jm 



■Cm 



dU, 



M 



< 



2-(^min(^'^«l^«)+log/Cr-logir) 

\ rc{l,2,...,m.} 



< 



\ rc{i,2,...,m} 



(4.11) 



Using the constraint of eq. fl4.10p for the entanglement cost logf^T — logLT-, we 
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simplify the previous inequality to 



„ Afl N2 Nm 

/ EE---E 





< 






rc{l,2,...,m} 










< 


^« < 


2 


io 



M 



(4.12) 



Taking normalisation into account, with Pjj^j{Um) = Tr(u;j^'^^^ (Um)) and ■?/; 
"^^^ }j ^[^ , we trace out the left hand side of eq. (14. 121) and obtain 



Ni N2 



N„ 



PJm{Um) 



L 



M 



''Cm 



dUM < 



16' 



Applying the triangle inequality, we have 



M, 



dUM < -. 



Combining this with eq. f l3.15p . found in the proof of Proposition 13.2.51 we have 
an upper bound to the decoupling error Qx{ip^^'^^ ® <|)^a/~) g^g function of the 



94 



Chapter 4 



parameter e: 



U(Ca/) jj=0i2=0 jm=0 



M, 



R 



dU 



M 



<2 V 



M 



r^0 



< 



rc{l,2,...,m} 



< 



E 



2g42^^min{'/'^) g2 



rc{l,2,...,m.} 



< 



2^+7 8-4 

The fourth hne holds by the strong subadditivity of the min-entropy 



and the last line holds since 



^^min(^ ) = -logAmax(^ ) < log^Cr- 

By Proposition 13.2.41 there exists a state merging protocol for the state iP'^mBR ^\^\^ 
error 2\Je^ /A = e, and so we are done. □ 



Lemma 4.6 of Berta [27] suggest an improvement of our previous Theorem: re- 
placing the min-entropies H^i^{ip'^^\ip^) appearing in eq. fl4.10l) by their conditional 
versions H^i^{T\R)tp. The theorem would remain valid for these weaker constraints on 
the entanglement cost provided we can prove a more general version of Lemma 14.1.11 



V{Cm) 



"Cm 



cIUm < 



\ rc{l,2,...,m} 
\ 7 



2-iHmin{i'T''W^)-logLr)^ 

1,2,... 

(4.13) 

where cx^ are 2™ — 1 possibly different density operators. Assuming this inequality to 
be true, we can adapt the previous proof by setting := (T^, where Hmi^{ip^^\aj^) = 
H^in{r\RU, and logi^r - log^r > -H^^{r\R)^ + 41og {\) + 2m + 8. 
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The results of Berta also suggest another improvement of our Theorem 14.1.21 
smoothing the min-entropies H^in(T\R)^ around sub-normalized density operators 
ip^^ close in distance to the state ip'^^'. To satisfy these looser requirements on the 
entanglement cost, we would need to adjust the proof of Theorem 14. 1.21 by updating 
eq. f l4.13p to an even stronger version, where 



U(Cm) 



d-CM 



dUM < 



L 



M 



dcM 



^ 2-(^min{T|i?)v.-logir)^ 



rc{l,2,...,m.} 



(4.14) 

and this inequality holds for any fixed e > 0. Presently, it is unclear if these improve- 
ments of Lemma 14.1.11 hold. We leave it as an interesting open problem: 

Conjecture 4.1.3. Let ip'^^BR ^ multipartite state shared between m senders and 
a receiver B, with purifying system R. For any e > 0, there exist multiparty state 
merging protocols for the state i^^mBR yjj^if^ error e whenever the entanglement cost 
^ := (log Ki - log Li, log K2 - log L2, . . . , log Km - log L„) satisfies 

logKr-logLr := ^^(log A^-logL,) > H^^,^{r B \ B)^, + (log l/e) + 0{m) (4.15) 

ier 

for all non-empty subsets T C {1,2,..., m}. 

The main difficulty in proving the conjecture is that it allows independent smooth- 
ing of each of the min-entropies. It is straightforward to modify our proof to allow 
smoothing using a common state for all the min-entropies, but the monolithic nature 
of the protocol does not naturally permit tailoring the smoothing state term-by-term. 
We can, however, give a partial characterization of the entanglement cost in terms 
of smooth min-entropies if we apply the single-shot state merging protocol of Berta 
[27] on one sender at a time. We will actually use a more recent result by Dupuis 
et al. [28], which characterizes the entanglement cost of merging using the current 
definition of the smooth max-entropies in terms of the purified distance. 

Proposition 4.1.4. For a multipartite state ip'^^BR gf^Q^^f]^ between m senders and 
a receiver B, let it : {1, 2, . . . , m} — )■ {1,2,..., m} be any ordering of the m senders 
Ci, C2, . . . , Cm- For any entanglement cost ^ = (log.ft'i— logLi, . . . , logi^m — log-^^m) 
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satisfying 



log I I 



2m 



> -H^-' + 41og — + 21og(13) for all 1 < t < m 



(4.16) 

where Ri := R<S>f=i+i^Trij) ^■^ relative reference for the sender C.„(^i), there exists 
a multiparty state merging protocol for the state tP'^mBB. ^^^^ error e. 

Proof Our multiparty state merging protocol for the state iP'^mBR consists of 
transferring the sender's systems one at a time according to the ordering vr: The 
sender (7,^(1) merges his part of the state first, followed by C^(2), C*^(3), etc. Write 
the input state ip'^^BR ^c^(i)i?iB^ where -Ri = -RC^(2)C'7r(3) • • • is the relative 

reference for the sender 6*7^(1). By Theorem 5.2 of |28], there exists a state merging 
protocol of error e/m and entanglement cost^ 



— +2 log 13 
e / (4.17) 



\ogK[ - \ogL[ := -ff^(a(i)|/2i)^ + 41og 

< logfr^(i) - logL^(i), 

B BR 

producing an output state p '^t^' '"(^^ "'^^ ^ satisfying 



Pi 



BTt(\)BR\ 



e 

< -, 
m 



(4.18) 



where the system Bt,{i) is substituted for the system Ct,{i). 

After C7r(i) has merged his share, the next sender C,r(2) performs a random instru- 
ment on his systems and send the measurement outcome to the receiver. Assume the 
parties share the state ip^^w^^^ (g)<|)'^i instead of the output state pi. Write the state 
^B^a.BR, ^c.(2)B2R2^ ^ith B2 := B^f^i^B and R2 := RC^(^3)C^^4) ■ ■ ■ C^{m). Using 
Theorem 5.2 of Dupuis et al. [28] for the second time, there exists a state merging 
protocol of error e/m and entanglement cost 



'2m\ 

41og(— +21og(13) 

e / (4.19) 



logi^^ - logL'2 = -i/^(a(2)|i?2) 



< l0gA'^(2) - logL^(2), 



^The merging error in |28j is defined in terms of tlie purified distance, which is lower bounded 
by the trace distance. We adjusted the entanglement cost proved in [28] to meet our definition of 
merging, which is expressed in terms of the trace norm. 
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El 



-Hi;Z(.Ci\C,I)U + 0{log{i)). 



-ffS(Ci|fl)s, + 0(log(l)). 




■+- 



+ 



-i/™(C2|ii),. + 0(log(l)) -//,5(C2|CiJf)e + O(log(l) 



E2 



Figure 4.1: Entanglement cost region prescribed by Proposition 14.1.^ for multiparty state 
merging for the case of two senders. The axes correspond to the entanglement 
cost El := logKi — logLi and E2 := log 7(^2 — log -^2- We have two possible 
orderings for the senders, and according to Proposition 14. 1 .41 two intersecting 
regions where existence of a 2-party state merging of error e can be shown. 
The region covered with circles (resp. crosses) is the entanglement cost region 
associated with merging the sender Ci (resp. C2) first. 



producing the output state " , 



which satisfies 



P2 



e 

< -, 
m 



(4.20) 



where the system Bt,(^2) is substituted for the system C^2)- If we apply the same 
protocol on the state " , we have an output state ps satisfying 



< \\P3 - P2II1 + ||P2 - ^ ^ $L2||^ 

< IIpi - ^jj^-W^^'^ (g) + ||p2 - ^B-WB-(2)BR2 ^ ^ $L2||^ 

2e 



< 



(4.21) 



m 



The second line is an application of the triangle inequality and the third line was 
obtained using monotonicity of the trace distance under quantum operations. The 
analysis for the other senders 6*7^(3), C^(4), C^(m) is performed similarly, and so. 
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the final output state pm satisfies 



< e. (4.22) 



1 



Hence, there exists a multiparty state merging protocol of error e for the state ip'^^BR^ 
with entanglement cost E := (logKi — logLi, log-ft'2 — log-^v2, • • • , logK^ — logLm) 
satisfying eq. f l4.16p . □ 

Figure 14.11 depicts the entanglement cost regions prescribed by Proposition 14.1.41 
for multiparty state merging for the case of two senders. Note that the hatched area 
is not part of the cost region characterized by eq. (14.101) . 

If only a single copy of ip'T^'T^^ is available to the involved parties, we can adapt 
the argument of Theorem 14 . 1 . 2 1 and prove the following result concerning the existence 
of split-transfer protocols with error e: 

Proposition 4.1.5. Given a partition T C {1,2,..., m} of the senders Ci, C2, ■ ■ ■ , Cm, 

let i^p'^'T^^ he a multipartite state shared between m senders and two receivers A 
and B, with purifying system R. For any ei,e2 > 0, if the entanglement costs 
= ®ier(^ogKi - log Li) and = 0^g7=(logMi - log A^j) satisfy 

J2i^ogK, - logL,) > -H^Ui^''''r\^RT^ + 4iog + 2\r\ + 8 

J](logM, - logiV,) > -H^Ui^y^'rl^pR-) + 41og + 2\r\ + 8 

iey ^^^^ 

for all non-empty subsets X C T and y , there exists a split-transfer protocol for 
the state ijp'^'TBR yjj^^j^ error ei + €2- 

Proof The proof is very similar to the proof of Theorem 14.1.21 First, we fix random 
instruments for each helper Cj in a manner analogous to Proposition 13.2.51 For each 
helper Ci in T, we have Fi = [ '^^l^' \ partial isometries Q{Ui of rank Li, where Qj is 
defined as in Proposition 13.2.51 and Ui is a Haar distributed unitary acting on CjC°. 
If FiLi < dc^Ki, we also have a partial isometry of rank L'i < Li. Similarly, for each 
helper Ci in T, we have Ci = [-^^r^J partial isometries Q{Ui of rank Ni, and one 
of rank A^^' if CiNi < dc^Mi. For a measurement outcome Jm '■= {ji,j2, ■ ■ ■ ,jm), let 
Jj- = 0jg7- ji be the vector of length t = \T\ whose components correspond to the 
measurement outcomes for the senders belonging to the cut T. The i-th element of 
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J J- will be denoted by ■ Define 

^Vr""^ ■■= (QtUt ® /R^)V'^^^(Q]'f/r ® WY, (4.23) 
where Q^^ := ^i^-j-Qi' ■ ^PP^Y^^S Lemma [4. 1.11 to the state ifp'^''' ® t^''' , we have 



Fi F2 



Ft 



V(Cr) 



ir(i)-i ir(2)-i ir(t) 



riier-^i-^' 



r 



< 



< 



dcr^T 



\ 



,(V.^«r |V>«r ) + log X;t-log L;t.) 



E 2^'"" 



i„ {.pXRr I ^flr )+iog Kx-\ogLx) 



(4.24) 



where Kx '■= Y\iex ^Y hypothesis, we have 

|^(logi^, - log Li) > -i/^i,(V;'^^^|V;^^) + 41og (^i^ + 2\r\ + 8 



for all non empty subsets X <Z T. By the proof of Theorem I4.1.2[ we have the 
following bound on the average quantum error Qj^tp^^^''' ^^'t): 



Fi F2 



f E E •■■ E p^r 



dU- 



T 



< 



< 



2En 

XC^Ti^X 
X^% 



Fi F2 Ft 
^' ir(i)=iir(2)=i ir(t)=i "^'^^^ 



dU' 



T 



E 



f2 ,2 

- 2*+7 ^ 8 - 4 ■ 



where t = |T|,pj^ = Tr(a;J^'^''') and ipj^^'^ = ■^-Uj^^'^. In a similar way, we bound 



T^Rt — 1 , J"^Rt 
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the average quantum error Q'^i^ip^^^r (g) T^^t) as follows: 

Gi G2 



Gm — t 

E E ••■ E 

^^^^^ ir(i)=OM2)=o Jr(,n-o=o 
< > — h — 

- 22(™-*)+8dcv 8 



da 



T 



< ^ 1- ^ < ^. 

- 2m~t+7 8 ~ 4 

By Proposition I3.3.2[ there exists a split-transfer protocol of error ei +62, and so we 
are done. □ 



4.2 One-shot distributed compression 

Both Theorem 14.1.21 and Proposition 14.1.41 describe entanglement cost regions 
where multiparty merging is achievable for any fixed e > 0. The proof of Theorem 
14.1.21 is significantly more complicated than that of Proposition 14.1.41 To illustrate 
the benefits accruing from the additional effort, we modify our second example of 
Chapter 3 for distributed compression and show that Theorem 14.1.21 "beats" Proposi- 
tion |1?L31 That is, it shows the existence of protocols which allow the senders Ci and 
C2 to transfer their systems for free, a task impossible, for this example, for protocols 
of the kind described in the proof of Proposition I4.1.4[ Our last example considers a 
family of states for which smoothing has little effect on the min-entropies appearing 
in eq. KW) . 

4.2.1 Example I 

Recall the state ^c'lCaCs-R fQj~ ^j-^g second example of Section [3.2.41 

where |0)'^2C3 := y/X\00)^2'^i + Vl — A|ll)'^2*-^3 is a pure bipartite entangled state 
with entropy of entanglement: 

E{cl>) = S{Ci)^ = -A log A - (1 - A)log(l - A) > 0. 



101 



Chapter 4 



(4.25) 



Let's replace the EPR pairs |\l/_)'^^*^2 and |\E^_)*^3^ by maximally entangled states of 
dimension d and the state by a (ii-dimensional pure bipartite entangled state 

where \'d)'-^2'^3 := YlfLi with Aj > Aj+i and entropy of entanglement: 

di 

E{'d) = S{C^,)^ = - I] log A, > 0. 

1=1 

Let £"2, -E's) be an entanglement cost-tuple satisfying the requirements of 
Theorem I4.1.21 

El > + 41og(l/e) + 14 

E2 > -//„,i„(^^^^|^^) + 41og(l/e) + 14 
Ei + E2> -H^U^^'^^'V) + 41og(l/e) + 14 
Ei + E,> + 41og(l/e) + 14 

E2 + E,> + 41og(l/e) + 14 

Ei + E2 + E,> + 41og(l/e) + 14. 

As in our previous analysis of such states in Section 13. 2. 4^ we focus on the costs Ei 
and E2 sufficient for merging with fixed error e. By additivity of the min-entropy, we 
can simplify the min-entropies appearing in eq. fl4.25p for the costs Ei and E2: 

H,^Ui^'''''\^P'') = H^Ui^""') = log(rf) - log Ai(?9^') 

i/min(V^^^^^^|^'') = i/min(V^^') = - log Ai (^^^^ ) . 

The constraints on the costs {Ei,E2) become 

El > -log(rf)+41og(l/e) + 14 

E2 > log Ai(^?^2) - log(rf) + 41og(l/e) + 14 (4.26) 
Ei + E2> log Ai(?9^2) + 41og(l/e) + 14. 

For a fixed error e, the right hand side of the cost sum Ei + E2 will dominate the 
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coefficients 41og(l/e) + 14 if we adjust the min-entropy of the reduced state 
to be an increasing function of d. For the purpose of showing the superiority of 
Theorem 14.1.21 over Proposition I4.1.4t set ^^"^aCa to be the maximally entangled state 
of dimension c?^. This gives the following constraint on the cost sum Ei + E2: 

Ei + E2> -elog(d) + 41og(l/e) + 14, 

which is negative for large d and any fixed e > 0. Therefore, by boosting E^ enough 
to satisfy the other constraints of eq. f l4.25p . there exists a multiparty state merging 
protocol with error e and entanglement cost-tuple (-Ei, £'2, -E3) such that both Ei and 
E2 are negative. 

Turning to the entanglement costs provided by Proposition 14. 1 .4] let's attempt to 
achieve negative costs E[ and E'2 for the senders Ci and C2. This obviously requires 
the sender C3 to be the ffist to transfer his system. Thus, it restricts the possible 
ordering of the senders to either {6*3,(72, Ci} or {C3, Ci, C2}. Assuming C2 is the 
next sender to transfer his system, the cost E2 must be at least 

E2 > -i/L(C^2|C7i/2)^ + 41og(6/e) + 21og(13), 

where 5 := e^/468. By Lemma 5 of Renes et al. [73], which bounds the smooth min- 
entropy by the conditional von Neumann entropy and a function of the smoothing 
parameter 6, we have 

Hi^{C2\C,Rl^ < S{C2\CiR)^ + 8S{e + 1) log(rf) + 2/i2(25) 

= -log(ci) + elog(ci) + 86{e + 1) log(ci) + 2/i2(25) 

where h2{x) is the binary entropy function /i2(x) = — xlog(a;) — (1 — x) log(l — x). 
Hence, the entanglement cost E2 is at least 

- ^ - e) log(rf) - 2/12(25) + 41og(6/e) + 21og(13), 
468 

-e) log(rf) +41og(6/e) + 5 
e < 0.9. 

Ci is the next to transfer his system to the receiver, the cost 



^2 > (1 - — 
^ - ^ 468 

4e2 



> a 
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E[ must be at least 

E[ > -Hi^^{C^\C2R)^ + 41og(6/e) + 21og(13). 
By strong subadditivity of the smooth min entropy, we have 



-f^min(Cl|C'2-R)v - -f^mm(Cl|C'2C'2-R) 



< -^min(Cl|C'2)v 

where t^'^ is the maximally mixed state of dimension d and the last line is obtained 
by using the duality between the smooth min and max-entropies, eq. (12.211) . Using 
Lemma [A. 5. II with k = \d{l — 26)], we have 

^Lx(^''^)>21og^ 

= 21og(A;- 1) -log(rf) 

> log(rf) - 2. 

The last line follows from this very weak lower bound on k — 1, which is nonetheless 
sufficient for our purposes: 

k-l= \d{l - 26)] - 1 

= Lrf(i-25)J 
d 

> - 

- 2 

for any e < 2. Hence, the cost E[ for merging Ci is bounded by 

E[ > -<i„(Ci|C2i?)^ + 41og(6/e) + 21og(13) 

> ^Lx(^f ) + 41og(6/e) + 21og(13) (4.27) 

> log(rf) +41og(6/e) + 5, 

which is positive for any d and e > 0. Thus, Proposition 14.1.41 provides no entan- 
glement cost tuple {E[,E2,E'^) allowing negative costs for the senders Ci and C2. 
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The interpretation from Section 13.2.41 remains vahd in the one-shot regime for the 
multiparty merging protocols provided by Theorem 14.1.21 the excess of entanglement 
distributed between the receiver and the sender C3 is used by the decoder to transfer 
the other shares Ci and C2 for free, with possible extra entanglement shared with the 
receiver. 

4.2.2 Example II 

Our second example for illustrating the benefits of Theorem 14 . 1 . 2 1 over Proposition 
14. 1.41 considers the same kind of state ■^^C'lC'zCsR^ Y)ut with a less artificial state which 
does not depend on the "merging" error e: 



where = Xlj=i Vi is the dth harmonic number. These states are known as 
embezzled states and were introduced in van Dam et al. [71] . They are useful resources 
for channel simulation and other tasks [75l [76l [77] . 

First, to simplify calculations, we use the following bound on the dth harmonic 
number: 

ln(d+ 1) <Hd< \n{d) + 1. 

Let's compute a bound on the von Neumman entropy of the reduced density opera- 
tor ■t}'-''^: 



5(C^2). = -E-^l0g 



Md) + \og{Hd) 



ij=i 

1 log(rf) 
H, 2 



\oMhi{d} f Ahgd 
- 21n(rf+l) 

<^ + loglog(ci). 
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The fifth line was obtained using the bound ln((i) + 1 < ^ '"^ , holding for sufficiently 
large d. The maximum eigenvalue for the reduced state is 1/Hd, giving us the 
following constraint for the cost sum Ei + E2: 

Ei + E2>- \og{Hd) + 41og(l/e) + 14 

which is satisfied by choosing Ei and E2 such that 

El > -log(rf)+41og(l/e) + 14 
E2>- log log(rf) - \og{d) + 4 log(l/e) + 15 
Ei + E2>- log log(d) + 4 log(l/e) + 15. 

Thus, although loglog((i) grows very slowly with d, we can nonetheless choose costs 
El < and E2 < for sufficiently large d and any fixed value of e > 0. 

Now, for the protocols of Proposition 14.1.41 with the ordering {6*3,(72, Ci} se- 
lected for the senders, we can substitute the entropy S'(C|)^ by the bound of eq. fl4.28p . 
and obtain a lower bound on the cost E!^: 

E'^>{1- 165) \og{d) - S{Cl)i) - 2h2{25) + 41og(6/e) + 21og(13), 

- ^^/^ ' 117^ ^"^^"^^ ~ loglog(f/) + 41og(6/e) + 5. 

which is positive for any e < 2. For the other possible ordering {C3, Ci, C2} of the 
senders, the E[ cost is independent of the entropy of the state -d^^ (see eq. f l4.27p ). 
So once again, we have an example where the protocols of Theorem 14.1.21 allow the 
senders Ci and C2 the possibility to send their systems for free, although the di- 
mension d required for achieving this is unrealistic for this example. The protocols 
provided by Proposition I4.1.4[ on the other hand, allow no such feat for any value 
of d. 

4.2.3 Example III 

Suppose two senders Ci and C2 have systems Ci and C2 of dimensions d in a 
state of the form 
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These states are close relatives of the embezzling states introduced in the previous 
example. They make interesting examples because they have sufficient variation in 
their Schmidt coefficients that the i.i.d. state merging rates of Theorem 13.2.31 are not 
achievable in the one-shot regime. Simple teleportation of the two systems Ci and 
C2 to the receiver would require logd EPR pairs by sender [78]. Our protocols will 
yield nontrivial one-shot rates that are significantly better than teleportation. 

We assume that < a for i ^ j and try to express the lower bounds of 

eqs. (14.101) and (14.161) in terms of a. 

Protocols from Theorem 14.1.21 

Let [El, E2) be a pair of entanglement costs achievable according to Theorem l4.1.2[ 
The only constraints on the costs (aside from needing to be the logarithms of integers) 
are 

El > 
E2 > 
E1 + E2 > 

To begin, we will find a sufficient condition for the Ei constraint to be satisfied, so 
we need to evaluate Hj^i^{ip'^'^^\ip^). Let Amin be the smallest real number such that 
Amin(-^*"^ ® 'ip^) — ip^^^ > 0. Expanding the operators, the condition is the same as 

where 6ij is the Kronecker delta function. Let A > be any real number. By the 
Gershgorin Circle Theorem [7T1[72], the operator — is positive if each 

diagonal entry dominates the sum of the absolute values of the off-diagonal entries in 
the corresponding row. That condition reduces to 



-iJ^i„(^^^^|^A^) +41og(l/e) + 12 (4.29) 
-iJ^i„(^A^^^|^A^) +41og(l/e) + 12 (4.30) 
-i/^i„(^A^^^^^|V^^) +41og(l/e) + 12. (4.31) 
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holding for all i, which is true provided A — 1 > a J2'j=i VWi- But 

Therefore, if A > 2ad + 1, the operator A(J*^^ ® — ip^^^ is positive. Hence, we 
have 

-i7„,in(^/;^i^|V'^) = log A„,in < log(2arf + 1) < log(ad) + 2, 
provided a > ^. The lower bound of eq. fl4.29p is satisfied if we set 

El > log(arf) + 41og(l/e) + 14. 

The interpretation is that if the states are indistinguishable, then Ci holds 

the whole purification of R and must therefore be responsible for the full cost of 
merging. As the states {iV'i)} become more distinguishable, the purification of R 
becomes shared between the systems Ci and C2, allowing the entanglement cost to 
be more distributed between the two senders. Indeed, if a = 0{l/d), the lower bound 
on El becomes a constant, independent of the size of the input state {■ip)'-'^'-'^^ . 

Moving on to the E2 constraint, eq. ( I4.30p . the state jg classical quantum 
state with classical system R: 

Since the state \'ipj){ipj\'"^ is pure for all j, conditioning on the classical system reduces 
the min-entropy to zero. A formal proof of this fact is obtained by applying Lemma 
3.1.8 of Renner [59]. Thus, the lower bound of eq. fl4.30p is satisfied if we set 

>41og(l/e) + 12. 

For the sum rate Ei + E2, it is necessary to evaluate iJmin(^/''"^*"^^|V''^)- Since the 
state ■?/;<^i<^2-^ is pure, we can apply Proposition 3.11 in Berta [27] and obtain 

-H^iniip^^'^^^^li;^) = Ho{ip^'^^) := log rank (V^^^ ^ j^g^^ 
Hence, by Theorem 14.1.21 there exists a multiparty state merging protocol for the 
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state ijj'^^'^'^^^ with error e and entanglement cost pair {Ei,E2) satisfying 



E2 



El 



> 



> 



> 



log(ad) + 41og(l/e) + 14 

41og(l/e) + 12 

log(ci) +41og(l/e) + 12. 



(4.35) 
(4.36) 
(4.37) 



The total entanglement cost Ei + E2 must be at least log d plus terms independent of 
the dimensions of the systems, and we can distribute that cost between the senders 
Ci and C2. The lower bound on E2 varies independently with d and can be regarded 
as a small "overhead" for the protocol. There is a minimal d-dependent cost for Ei, 
however, which encodes the fact that if the sender C2 does not carry enough of the 
purification of R by virtue of the nonorthogonality of the (iV'i)}) then more of the 
burden will fall to the sender Ci. 

Protocols from Proposition 14.1.41 

Now let us consider the lower bound of eq. fl4.16p (Proposition 14. 1.41) . For fixed e, 
the proposition provides two cost pairs, plus others that are simply degraded versions 
of those two arising from the wasteful consumption of unnecessary entanglement. 
Proposition 14.1.41 does not permit interpolation between the two points, as compared 
to Theorem I4.1.2[ It might be the case, however, that Proposition I4.1.4f s freedom 
to smooth the entropy and vary the operator being conditioned upon could result in 
those two cost pairs being much better than any of those provided by Theorem 14.1.21 
On the contrary, for the states of the example, the improvement achieved with the 
extra freedom is minimal. 

Let (£'^,£'2) be a cost pair achievable by Proposition 14.1.41 For the purposes 
of illustration, consider the point with the smallest possible value of E2. Letting 
6 = e^/208, that point will satisfy 



First, to bound the entanglement cost E2, we use Lemma 5 of Renes et al. [73] : 



E[ > -i/^i,(^^^^^^|C2i?)+41og(4/e) + 21og(13) 
E2 > -i^;li,(V^^^^|i?)+41og(4/e) + 21og(13). 



(4.38) 
(4.39) 



i7;^iJ^^^«|i?) > -S{C2\R)^ - 86\og{d) - 2h2{26) 
= -85 log(rf) -2/12(25), 



109 



Chapter 4 



where S{C2\R)^ = for the state ■j/^'^^^. Thus, we have 

E'^ > -851og(d) -2/i2(2(5) +41og(4/e) + 21og(13). 

Before introducing the extra comphcation of smoothing, consider first ifmin(^'"^'"^^|C*2-R)- 
By duahty of the smooth min and max-entropies, eq. (12.211) . we have 



d 



> log — 1 - O 



> \o ( 

°^Vlogc?^ 

= log (i — log log (i + log 5, 

where 4/(ln(i+ 1) > 5.7/ logrf for sufficiently large d. Therefore, ignoring smoothing, 
the total entanglement cost for Proposition 14.1.41 satisfies 

E[ + E',>il- ^) log(rf) - loglog(rf) + 14 + 8 log (^^^ . (4.40) 

for sufficiently large d, which has worse constants than the sum cost (14.371) for The- 
orem 14.1.21 Now let us introduce some smoothing. By duality of the min- and max- 
entropies, 

- HUi''''^'''\RC2) = H^^i^P^^). (4.41) 
Lemma IA.5.11 of Appendix B gives that 

HL.H'''') > 21ogmin <^ V : k such that V -— - < 26 \ . (4.42) 

Getting a lower bound on this expression requires finding large k that nonetheless fail 
to satisfy the tail condition. That restriction on k is equivalent to 1 — Hk/Hd < 26, 
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which will not be met by any k small enough to obey 

k<{d + iy~^^/e (4.43) 

for sufficiently large d. 

k-l / 1 Z"^' 1 \^ 

a ^J^) (4.44) 

> log A; — log log d + log 5 (4.46) 



for sufficiently large k. Substituting in the largest possible k consistent with eq. (I4.43p 
and 6 = e^/208 gives 

E[ + E',>(^1-^-^^ log{d) - loglogd + 12 + Slog (^^^ , (4.47) 

for sufficiently large d. The additional savings from smoothing are only about ^iq^"^^ 
ebits, which is insignificant for small e. These tiny savings also come at the expense 
of being able to interpolate between achievable costs. To be fair, these states were 
chosen specifically because they are known to maintain their essential character even 
after smoothing, as was observed in [TH]. The freedom to smooth is certainly more 
beneficial for some other classes of states, most notably i.i.d. states. Indeed, since 
S{CiC2)ip = (log(i)/2 + 0(loglog(i), merging many copies of |'(^)C'i'^2-f^ can be done at 
a rate roughly half the cost required for one-shot merging. 
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Assisted Entanglement Distillation 



5.1 Introduction 



The protocols discussed in the previous chapters are based on a random coding 
strategy: the senders apply randomly chosen unitaries on their systems and perform 
projective measurements in a fixed basis. The decoder, conditioned on the measure- 
ment outcomes, applies an isometry on his systems and recovers the original state 
with arbitrarily good fidelity. A similar approach is used in [23] for solving the multi- 
partite entanglement of assistance problem when the state shared between the parties 
is pure. Recall that the formula is given by the min-cut entanglement of the state 



where the minimum is taken over all bipartite cuts T. (Recall that a bipartite cut 
consists of a partition of the helpers Ci, . . . , Cm into a set T and its complement 



Consider a one-dimensional chain with m repeater stations Ci, C2, . . . , Cm sep- 
arating the two endpoints (Alice and Bob). If many copies of the state = 
\/Ai|00) + -\/A2|11) are prepared and distributed across the network, so that the 
global state of the network is given by 




D^{ij^^'^^)=mmr{S{Ar)^}, 



(5.1) 



r={Ci,...,Cm}\r.) 



■ACiC2...CmB\(S)n 




then the previous formula applied to the state •qf^'~'i'-'2---CmB j-g^j^^gg the entropy 
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of entanglement S{A)^ of the state If the fiber optic transmitting the quantum 
information is perfect up to distances of roughly 100 kilometers, then we can establish 
close to nS{A)^ ebits between Alice and Bob no matter how far they are to each other 
by introducing repeater stations at approximately every 100 kilometers. Of course, 
as the number of repeater stations increases between Alice and Bob, more copies of 
the state must be distributed between the nodes of the network for the assisted 
distillation protocol to continue producing high quality entanglement at the rate 

As discussed in the introductory chapter, an implementation of the previous 
strategy will have to deal with the accumulation of errors during the various phases 
of preparation, distribution, storage and local operations of the quantum information. 
In this chapter, we continue the theoretical analysis of this problem by extending the 
models previously studied in [801 ED] to allow for an arbitrary mixed state between 
adjacent nodes. This is an initial step towards handling more complex and realistic 
situations. First, we will consider a network consisting of two receiving nodes (Alice 
and Bob), separated by a repeater node (Charlie), whose global state is a mixed state 
^ABC 'jj^ig ig realistic assumption, as we recall imperfections in local operations 
[13] and decoherence in the quantum memories will most likely introduce noise in the 
stored qubits. We study the optimal distillable rate achievable for Alice and Bob when 
assistance from Charlie is available. This problem reduces to the two-way distillable 
entanglement for states in a product form ip'^ ® ip^^- There is currently no simple 
formula for computing the two-way distillable entanglement of a bipartite state ip^^, 
which has been studied extensively by Bennett et al. and others in [38], EH ESI [83] . 
We do not attempt to solve this problem here, and turn our attention instead to 
good computable lower bounds for assisted distillation of mixed states. We provide 
a bound which exceeds the hashing inequality for states which do not saturate 

the strong subadditivity of the von Neumann entropy and allow the recovery of the 
C system if Alice and Bob can perform joint operations on their systems. 

5.2 Assisted distillation for mixed states 

5.2.1 The task 

In this section, we extend the entanglement of assistance to the case of a general 
mixed state tp^^'^: a measurement of Charlie's system followed by an entanglement 
distillation protocol between Alice and Bob. The problem is illustrated in Figure ISTTl 
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Figure 5.1: Quantum circuit representing a broadcast assisted entanglement distillation 
protocol. Solid lines indicate quantum information and dashed lines classical 
information. Charlie first performs a measurement, sending copies of the 
classical outcome to Alice and Bob. Alice and Bob then implement an LOCC 
operation, conditioned on that classical outcome. 

Problem 5.2.1 (Broadcast, Assisted Distillation). Given many copies of a tri- 
partite mixed state ip^^'^ shared between two recipients (Alice and Bob) and a helper 
(Charlie), find the optimal distillable rate between Alice and Bob with the help of 
Charlie if no feedback communication is allowed: Charlie performs a POVM and 
broadcasts the measurement outcome to Alice and Bob. The optimal rate is denoted 
by D^i^ip"^^^) . It is the asymptotic entanglement of assistance. 

We call a protocol which satisfies the constraint of Problem 15.2.11 a broadcast 
assisted distillation protocol. More formally, it consists of 

1. A POVM E = (Er^)-^^^ for Charlie. Without loss of generality, we can assume 
that the operators E^ are all of rank one. 

2. For each x, an LOCC operation Vx '■ A"'B"' — AiBi, where Ai and Bi are 
subspaces of A" and 5" of equal dimensions, implemented by Alice and Bob. 

We refer to a broadcast assisted protocol as an (n, e)-protocol if it acts on n copies of 
the state ■?/'^^*^ and produces a maximally entangled state of dimension M„ := 

^ M„ 
V m=l 
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up to fidelity 1 — e: 

X 



where 



j A"B" ^ I rp 



A real number i? > is said to be an achievable rate if there exists, for every n 
sufficiently large, an (n, e)-protocol with e — )• and ^ logM„ — i? as n — )■ oo. Lastly, 
we have 

(V'^^^) := sup{i? : R is achievable}. 

The restriction to POVMs with rank one operators in the preceding definition 
can be justified as follows: any POVM F containing positive operators with rank 
higher than one that Charlie would wish to perform can be simulated by a POVM 
E with rank one operators on Charlie's system followed by some processing by Alice 
and Bob. More precisely, suppose Charlie wants to perform a POVM F = {F^} on 
his state with some operators having rank greater than one. Consider the spectral 
decomposition of each operator: 



where {|af)} are eigenvectors of Fr^ with eigenvalues {Af}. Then E = {Af |Q;f)(af 
is a POVM with rank one operators. Instead of performing the POVM F, Charlie 
does a measurement corresponding to the POVM E. After Alice and Bob receive the 
measurement outcome, the state is given by 



A2B2 



To simulate F being performed by Charlie, Alice and Bob can trace out the A2 and 
B2 systems. The state becomes 



^AA^BB, ^ J^Pxi^x"" ® \XX){XX\ 



AiBi 



with ip^^ := ^ J2iQx,i'ipx,f px = J2i1x,i- Observe that this preprocessing can be 
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embedded within the LOCC operation Vx- Hence, there is no loss of generahty in 
assuming POVMs with rank one operators in step 1 of the protocol. 

For pure states, D'^ltp) reduces to the asymptotic entanglement of assistance 
E^[ip). For product states of the form ip'~^ ip^^ , D^[ip) is equivalent to the two- 
way distillable entanglement D{ip^^). A formula is known for the two-way distillable 
entanglement (see Theorem 15 in Devetak and Winter [29]), but its calculation is 
intractable for most states. We will instead use the hashing bound to the one-way 
distillable entanglement D^{ip^^) p9], which is much easier to evaluate. We remind 
the reader of the result for convenience: 

Lemma 5.2.2 (Hashing inequality [381 [29]). Let ij)^^ he an arbitrary bipartite mixed 
state. Then, 

D^(^^^) > S{B)^ - S{AB)^ =■ I{A)B)^. (5.2) 

5.2.2 Entanglement of assistance 

As mentioned before, it was shown in [23] that for pure states, the operationally 
defined quantity corresponds to the regularization of the one-shot entanglement of 
assistance E^. In a similar fashion, we define the one-shot entanglement of assistance 
Da{'>P^^'") of a tripartite mixed state ip^^^ and show that its regularization is equal 
to D^iyip^^'^). We then look at some of the properties of Da{iP^^'")- 

Definition 5.2.3. For an arbitrary state tp"^^^ , define 



sup lj2p.D{i;^' 



i,^"" = -TicliEx ® Jab)^^''^] } , (5.3) 

Px 



where p^ = Tt[ExiI''"] O'nd the supremum is taken over all POVMs E = {E^} with 
rank one operators on Charlie's system C. 

The quantity D^i'ip^^'") can also be characterized using a maximization over all 
pure state decompositions {pi,ipf^^} of the purified state ip^^^^-. 

Proposition 5.2.4. Let ip^^'~^ be an arbitrary state, with purification ip^^'^^ ^ then 

DA(t^^^^)= sup 5^p.D(^,^^), (5.4) 

where the supremum is taken over all ensembles of pure states {pi^^jf^^} satisfying 
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Proof Any rank one POVM on C induces an ensemble of pure states on ABR 
with average state and for every such ensemble there exists a corresponding 

POVM [nS]. Applying this observation to the definition of the one-shot entanglement 
of assistance yields the result. □ 

We can interpret eq. (15. 4 p as follows: by varying a POVM on his state, Charlie 
can collapse the purified state ip^^'^^ into any pure state ensemble decomposition 
{pi, tpf^^} for the A,B, and R systems. Since we don't have access to the purifying 
system R, the quantity Da{'iP^^'") maximizes the average amount of distillable entan- 
glement between Alice and Bob. The next result shows that the regularized version 
of Da is in fact equal to the asymptotic entanglement of assistance D'^^tp!"^^'^). 

5.2.3 Basic properties 

Theorem 5.2.5 (Equivalence). Let ip^^^ he an arbitrary tripartite state. Then the 
following equality holds: 

D^{^^^^) = lim -Da( (^^^^)®"Y (5.5) 

n.-5>oo n \ J 

Proof We demonstrate the "<" first. Consider any achievable rate R for a broadcast 
assisted protocol. By definition, there exists, for every n sufficiently large, an (n, e)- 
protocol with e — and ^ log(M„) — i? as n — oo. For a protocol working on 
n copies of the state ip^^'~^ , denote Charlie's POVM hj E = (-E^;);^^^, and for each 
outcome x, the LOCC operation implemented by Alice and Bob by V^- Write 



x=l 



x=l 



where p^ = Tr[E^(7/^^)®"] and i^f^'^ = j^TicAiE^ ® lAB)i^ABc]- The state Q^^^^ is 
the output state of Vx{ip^"^")- By hypothesis, we have 
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which, shifting to the trace norm, imphes 



AiBi 



< 2v^ := e'. 



(5.6) 



The trace distance is non-increasing under the partial trace, and so tracing out the 
Ai system, we have 

" ~ " <e', (5.7) 



where = ^ ^;^:Jm)(m|^^ 

We can apply the Fannes inequality (Lemma I2.2.3P on eqs. (15. 6 p and (15. 7p to get 
a bound on log(M„) in terms of the coherent information of the state QaiBi- 

logM„ < S{B^)n - S{A^B^)n + 3 log(M„)r7(e') 
= /(Ai)5i)n + 31og(M„)r/(e'), 

where ?7(e') is a function which converges to zero for sufficiently small e'. (The def- 
inition of r]{e') can be found in Lemma [2.2.31 ) Using the convexity of the coherent 
information [H^, the hashing inequality, and the definitions of D and Da, we get the 
following series of inequalities: 

logM„ < I{A^)B,)n + 31og(M„)r;(e') 

< 5^p,./(Ai)fii)n. +31og(M„)77(e') 

X 
X 

<5^p.D(^f^") + 31og(MOr/(6') 

X 

<D^((^^^^)®") + 3nlog(rf^)r/(6'). 

Since e — > and ^ \og{Mn) — > -R as n — > oo, the achievable rate R is at most 
lim„_j.oo ^-Da(('^"^^*")®"'), which proves the "<" part since R was arbitrarily chosen. 

To show the ">" part, suppose Charlie performs any POVM E = (E^) on one 
copy of the state ip^^'^ and broadcasts the result to Alice and Bob. They now share 
the state 



A' ABB' 
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Since Alice and Bob know the outcome of Charlie's POVM, the distillable entangle- 
ment of ip^'^^^' is at least 

To see this, consider many copies of and let Alice and Bob perform projective 

measurements on the systems A' and B' for each copy of the state. Group the outcome 
states into blocks, where each block corresponds to a specific measurement outcome. 
For each of these blocks, there exist LOCC operations Vx which will distill arbitrarily 
close to the rate D{ip^^). Thus, there is a protocol achieving the rate '^^PxD^ip^^), 
which proves the ">" part. □ 

Finding a formula for the one-shot quantity Da{iP^^^) appears to be a difficult 
problem, and so we look for upper bounds which are attained for a subset of all 
possible states. For the remainder of this section, we look at two upper bounds and 
give examples of states attaining them. 

Proposition 5.2.6. Let ^^^'^ he an arbitrary tripartite state. We have the following 
upper hound for Da{'iP^^'") 



D^(^^^^)<inf^p,i?^(^,^^^^ 



I 



where the infimum is taken over all ensembles of pure states {pi,ipf^'"} such that 



Proof Let ifj^^^ = YliPi'^f^'" ^ where the states ipf^'^ are pure. Consider the 
following classical-quantum state (j)^^^^ = '^^Piipf^'^ ® \i){i\^. If Charlie is in 
possession of the X system, then 



lABCX\ 



by the definition of Da- By the convexity of DA{(f)^^'"^) on the ensemble {pi, ip^^'-^ ' 



){i\^} (see Proposition lAH]) and the fact that ® = 

we have 

< Z^a(0^^^^) < Y.P^EaO^^''^ (5.8 
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Since this holds for any pure state ensemble {pi,'ipf^'"}, we arrive at the statement 
of the proposition. □ 

With this result in hand, we now exhibit a set of states for which we can compute 
the value of Dj^ exactly. 

Example 5.2.7. Consider the following family of classical- quantum states, with clas- 
sical system C: 

i=l 

where tpf^ are pure states. Since Da is convex on pure ensembles {pi,4'f^'"} , the 
quantity D^i'ip'^^'^) is upper bounded by YliPi^Ai'ipf^ ® Since assistance is 

not helpful for a product state ip^^ , we have that D^iipf^ ^ = ^{i't^) = 

S{A)^^. By considering the POVM E = {\i){if}i^i, we also have Da{Sj^^^) > 
YliPi^ii^f^) — 'l2iPiS{^)tp' ■ Hence, for this special class of classical- quantum states, 
the upper bound is attained and Da is just the average entropy of the A system for 
the ensemble {pi,ipf^}- 

Proposition 5.2.8. Let ip^^'~^ be an arbitrary tripartite state. Then 

Proof From Proposition 15.2.61 and the concavity of the entanglement of assistance 
quantity Ea (see [21] for a proof), we have 

D^(^^^^)<inf 

i 

= inf 

i 

<mf EA{J2p.i^r) 

i 

= EAir^") 

where the minimization is taken over all pure state ensembles {pi, ipf^'"} of the state 

The previous bound on Da is better understood by imagining the following sce- 
nario. The A' system of a pure state ip"^^' is sent to a receiver (i.e Bob) via a noisy 
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channel N", which can be expressed in its Stinespring form as M{ip) = TteUpW , 
where U : A' ^ BE is an isometry. Another player, Charlie, tries to help Alice 
and Bob by measuring the environment and sending its measurement outcome to 
Alice and Bob. Two cases can occur. If Charlie has complete access to the envi- 
ronment, the best rate Alice and Bob can achieve is given by the entanglement of 
assistance EaO^'^^'")- More likely, however, is the case where Charlie will only be 
able to measure a subsystem Ci of the environment E = C1C2. In this situation, the 
optimal rate is given by the one-shot entanglement of assistance D^i'ip'^^'^^), where 
^ABCi _ T^Y^^^ip^BE _ Since this case is more restrictive to Charlie in terms of measur- 
ing possibilities, it makes sense that Da{4'^^'") < E^ii^'^^'") for any tripartite mixed 
state ■ip^^'-' . This bound will be attained for all pure states ■j/;^^'^ since Da reduces 
to Ea in this case. 

5.3 Achievable rates for assisted distillation 

In this section, we find the rates achieved by a random coding strategy for as- 
sisted entanglement distillation. The helper Charlie will simply perform a random 
measurement in his typical subspace. In light of the equivalence demonstrated in the 
previous section, eq. (15.51) . we will prove a lower bound on the asymptotic entangle- 
ment of assistance by bounding the regularized entanglement of assistance quantity. 
We will use a much simpler form of Proposition 13.2.51 

Proposition 5.3.1. f24\l Suppose we have n copies of a tripartite pure state tp^^^, 
where S{R)^ < S{B)^. Let ip^^^ he the normalized state obtained by projecting 
C"", -B"", i?"" into their respective typical subspaces C, B, R. Charlie performs a projec- 
tive measurement using an orthonormal basis {\ei)'"} ofC chosen at random according 
to the Haar measure. Denote by pi the probability of obtaining outcome i. Then, for 
any e > 0, and large enough n, we have 




where ijjf' is the state of the system R upon obtaining outcome i. The average is taken 
over the unitary group U(C) using the Haar measure. 

Proof To choose a random orthonormal basis {\ei)^}'l^i, let \ei)^ = U\i)^ , where U 
is a Haar distributed unitary on U(C') and is the computational basis on C. 
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A measurement in the basis {|ej)*"} is equivalent to applying the unitary W on the 
C system, followed by a projective measurement in the computational basis. Hence, 
we can apply Proposition 13.2.51 with m = 1, L = 1 , and K = 1. This gives us 



Jv{c) i "c V 



^RC 



Since Tr[^/;|^] = Tr[?/^|], we can use the properties of typicality found in eqs. fl2.18p 
and fl2.19p to simplify the last equation: 



U(C) 



^ - - 2l~"(^(^)'^-'5) J2-n{S{B)^-S{R).^-A6) 

Y,pM - ^%dU < + 2 ^-^ , (5.9) 



where S and ( can be made arbitrarily small by choosing n large enough. If the 
condition S{R)^ < S{B)^ is satisfied, the right hand side of eq. (15. 9p vanishes (i.e 
can be made less than any e > 0) as n grows larger. □ 

In Horodecki et al. [23], the previous proposition was used to study assisted 
distillation of pure states. The following theorem generalizes the reasoning used there 
to the mixed state case. The lower bound on the rate at which ebits are distilled, 
involving the minimum of I{AC)B)^ and I{A)BC)^, suggests that C is merged either 
to Alice or Bob, at which point they engage in an entanglement distillation protocol 
achieving the hashing bound. This need not be the case, however. In the discussion 
following the proof of the theorem, we will exhibit an example where merging is 
impossible but the rates are nonetheless achieved. 

Theorem 5.3.2. Let ip^^'^ be an arbitrary tripartite state shared by two recipients 
(Alice and Bob) and a helper (Charlie). Then the asymptotic entanglement of assis- 
tance is bounded below as follows: 

D^iij^'''') > max{/(A)5)^, (5.10) 

where L{tP) := mm{I{AC)B)^, I{A)BC)4,}. 

Proof That D'^{ip^^'^) is always greater than or equal to the coherent informa- 
tion I{A)B)^ follows from the hashing inequality and the fact that Charlie's worst 
measurement is no worse than throwing away his system and letting Alice and Bob 
perform a two-way distillation protocol without outside help. Hence, it remains to 
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show that > mm{I{AC)B)^,I{A)BC)^}. 

Since D'^{iIj^^'-^) is equal to the regularization of Da{iP^^'^), we only need to 
show the existence of a measurement for Charlie for which the average distillable 
entanglement is asymptotically close to L^ip). We prove this fact via a protocol 
which uses a random coding strategy. The state ip^^^ and its purifying system R 
can be regarded as: 

1. a tripartite system composed of C, AB, and R. 

2. a tripartite system composed of C, AR, and B. 

Let's consider n copies of ■ip'^^'-^ , and furthermore, let's assume that S{AB)^ (resp. 
S{AR)^) and S{R)^ (resp. S{B)^) are different. This can be enforced by using only 
a sub-linear amount of entanglement shared between chosen parties in the limit of 
large n. After Schumacher compressing his share of the state V^§", Charlie performs 
a random measurement of his system C. Let J be the random variable associated 
with the measurement outcome and let ip^"^"^" be the state of the systems A", B^ 
and -R" after Charlie's measurement. By Lemma IA.6.21 and the Fannes inequality, 
there exists a measurement of Charlie's system which will produce a state ^j^"^"^" 
satisfying, with arbitrarily high probability: 



S{A^B^)^^ = S{R^)^^ = n{mm{S{AB)^, S{R)^} ± S) 
= 5(5")^, = n{mm{S{AR)^„ S{B)^} ± 6), 



(5.11] 



where 6 can be made arbitrarily small by choosing n large enough. Applying the 
hashing inequality to such a state will give: 

= n{mm{S{B)^, S{AR)^} ±6)- n{mm{S{AB)^, S{R)^} ± 6) 
> n(min{S(fi)^, SiAR)^ - SiR)^ - 26) 
= n{mm{I{AC)B)^„I{A)BC)^} - 26). 

For each outcome j, define Xj to be the variable taking the value zero if 
satisfies eq (15. lip , or one otherwise. The average two-way distillable entanglement 
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for this measurement will be at least 

j X,=0 Xj=l 

> P{Xj = 0)n{mm{I{AC)B)^,I{A)BC)^} - 26) + vMi'f''") 

Xj=l 

> (1 - a)n [mm{I{AC)B)^, I{A)BC)^} - 26] , 

where a can be made arbitrarily small by taking sufficiently large values of n. Finally, 
we have 

j 

>(!-«) [mm{I{AC)B)^„ I{A)BCy^} - 26] . 
Since a and 6 can be chosen to be arbitrarily small, we are done. □ 

Corollary 5.3.3. Let ip^^'^ be an arbitrary tripartite state shared by two recipients 
(Alice and Bob) and a helper (Charlie). Then the asymptotic entanglement of assis- 
tance is bounded below as follows: 

^a(^^''^) > lim -supyp,L(af ^"^), (5.12) 

i 

where the supremum is over all instruments X := {Si} performed by Charlie, with 
^A^B^c ^ l-iid^-^"0£,){^%^) andp, = TT[£,i^n 

Proof First, to see that the maximization of eq. f lS.lOp can be removed, consider 
an instrument which traces out the C system: a^^ = Ti cip^^'"- Then, both 
coherent information quantities in L{a) reduce to the coherent information I{A)B)^. 
Achievability of the rate YliPi^i'^f^'")^ instrument Z performed on n copies 

of ip^^*^ , follows by considering a blocking strategy. □ 

Let's look into some of the peculiarities of the previous results. First, observe that 
the right hand side of eq. (15.101) is bounded from above by the coherent information 
I{A)BC)^. This follows from the definition of L{tlj), and the strong subadditivity of 
the von Neumann entropy, expressed in terms of coherent information quantities as: 



I{A)BC).^ > I{A)B)^. 
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When the lower bound of eq. (15.101) is equal to I{A)BC)^, we have I{A)BC)^ < 
I{AC)B)^, which implies by further calculation that I{C)B)^, > 0. Suppose that 
I{C)B)^ > and consider n copies of the purified state (^^^c'-R)®"^ written as 
^C"-B"R'i where Ri := AR is the relative reference for the helper C. State merging 
(see Chapter 3) tells us that a random measurement on the typical subspace C, 
as described in our protocol, will decouple the system from its relative reference i?", 
allowing recovery of C" by Bob up to arbitrarily high fidelity. Our assisted distillation 
protocol can be improved for this case by recovering the C" system at Bob's location 
before engaging in a two-way distillation protocol, which will now act on the state 
^^ABB-^^n^ where B is an ancilla of the same dimension as the C system. Since the 
distillable entanglement across the cut A vs BC cannot increase by local operations 
and classical communication, the previous strategy is in fact optimal. A small amount 
of initial entanglement between and 5" may be needed if I{C)B)^ = (see |24j). 

The previous analysis may lead us to believe that when the lower bound of 
eq. (I5.10p is equal to I{AC)B)^, a similar strategy of transferring the system C 
to Alice could be applied. However, the following counterexample will show that this 
is not always true. Let 

\^^BC,R ^ 1 1000)^^2^ + 1|110)^^^^ + -|111)^^^« and 

= ^|00)^^^ + y^lll)^^^ 

Alice and Bob are to perform assisted distillation on n copies of ijj^^^^'^'^ = ip^'^'^ (g) 
'0^C'2 with the help of a single Charlie holding both the Ci and C2 systems. Such 
a situation could arise in practice if Alice had a high-quality quantum channel to 
Charlie but Charlie's channel to Bob were noisy. The system R would represent the 
environment of the noisy channel. In this case, L{ip) is equal to I{AC)B)^, which 
is easily calculated to be approximately 0.40, since I{A)BC)^ ~ 0.81 and I{A)B)^ 
is negative. For this example, the achievable rate of our random coding protocol 
is therefore at least the rate that could have been obtained by a strategy of first 
transferring the state of the C system to Alice, followed by entanglement distillation 
between Alice and Bob at the hashing bound rate. However, the coherent information 
I{C)A)^ is negative for the state ip^^cn'^iR^ gy optimality of state merging, the 
state transfer from Charlie to Alice cannot be accomplished without the injection 
of additional entanglement between them. Therefore, the protocol achieves the rate 
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I{AC)B)^ without performing the Charlie to Ahce state transfer. 

This example also illustrates a general relationship between hierarchical distil- 
lation strategies and the random measurement strategy proposed in this chapter. 
A hierarchical strategy for a state ■^^^^c'lCa _ ^ ^c'a^ ^q^j^j consist of first 

distilling entanglement between A and Ci as well as between C2 and -B, followed 
by entanglement swapping to establish ebits between Alice and Bob. If the first 
level distillations are performed at the hashing rate, then this strategy will establish 
min[/(y4)Ci)^, /(C2)-B)^] ebits between Alice and Bob per copy of the input state. 
On the other hand, the random measurement strategy will establish at least L{ip), 
which in the case of the example is the minimum of 

I{AC)B)^ = I{C2)B)^-S{ACi)^ = I{C2)B)^and 
I{A)BC)^ = 

yielding exactly the same rate as the hierarchical strategy. (The first line uses the fact 
that ■ip^'^'^ is pure.) So, for the random measurement strategy to beat the hierarchical 
strategy, it is necessary that the state not factor into the form ip^'-''^ (g) ip^^^ . As an 
example, consider modifying the state of eq. fl5.13p by applying a CNOT operation 
between the systems Ci and C2 held by Charlie: 



CNOT 



A CNOT operation can be used to model phase dampening effects between an input 
state (i.e the C2 system) and its environment (i.e the Ci system). If the control qubit 
is the system Ci and the target qubit is C2, the previous state transforms to: 





1 











\ 







1 
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/ 



V V2 

The reduced state cff^ of the Ci system is equal to: 

:=i|0)(Or + ^|l)(ir, 
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and the reduced state ^ of the system ACi is given by 

:= l|00)(00|^^i + 

Hence, the coherent information I{A)Ci)fp is zero, yielding a null rate for the hier- 
archical strategy. On the other hand, the quantities I{AC)B)^ and I{A)BC)^ are 
equal to the coherent informations I{AC)B)^ and I{A)BC)^,. Thus, the random 
measurement strategy is tolerant against a CNOT "error" on Charlie's systems, as 
opposed to the hierarchical strategy, which fails to recover from this error. 

Finally, it is easy to determine conditions under which the random measurement 
strategy for assisted entanglement distillation will yield a higher rate than the hashing 
bound between Alice and Bob. As the next result shows, a state ip^^^ is a good 
candidate for the random measurement strategy if it does not saturate the strong 
subadditivity inequality of the von Neumann entropy, and if the C system can be 
redistributed to Alice and Bob provided they are allowed to perform joint operations 
on their systems. 

Proposition 5.3.4 (Beating the Hashing Inequality). For any state ip^^^ , the value 
of L{tp) is positive and strictly greater than the coherent information I{A)B)^ if 

I{C)AB)^ > and S{A\BC)^ < S{A\B)^. 

Proof The inequality S{A\BC)^ < S{A\B)^ can be rewritten as 

I{A)BC).^, > I{A)B)^, 

and the condition I{C)AB)^ > as 

S{AB)^ > S{ABC)^. 

By negating and adding S{B)^ on both sides of the previous inequality, we get back 

IiA)B)^ := SiB)^ - S{AB)^ < S{B)^ - SiABC)^ =: J(AC)5)^. 

□ 
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c 1C2 C^AB 




Figure 5.2: Quantum circuit representing a broadcast assisted entanglement distillation 
protocol involving three helpers. The three helpers perform their measure- 
ments, sending copies of the classical outcomes to Alice and Bob. Alice and 
Bob then implement an LOCC operation, based on that outcome. 



5.4 Multipartite entanglement of assistance 

In this section, we look at the optimal rate achievable when many spatially sepa- 
rated parties are assisting Alice and Bob in distilling entanglement. First, we extend 
the one-shot entanglement of assistance Da (Definition I5.2.3P to arbitrary multipar- 
tite states ^c'lCa-.-^B^ henceforth written simply as ip'^^AB ^ The type of protocols 
involved is depicted in Figure 15.21 

Definition 5.4.1. For a general multipartite state ^'^mAb ^ consider POVMs Ei, . . . ,E„ 
performed by {Ci,C2, ■ ■ ■ ,Cm} respectively which lead to a (possibly mixed) bipartite 
state 4'ki^^...km f^''^ POVM outcomes k := kik2 ■ ■ ■ km- We define the multipartite en- 
tanglement of assistance as 

k 

where the supremum is taken over the above measurements. The asymptotic multi- 
partite entanglement of assistance D'^^ip'^'''^^) is obtained by regularization of the 
above quantity D'^i'ip^^'^^) = lim^^oo ^DAi'ip'^''). 

For a pure state iP^mAB ^ jiximediate that the maximization in the preceding 
definition is attained for POVMs of rank one, leading to an ensemble of pure states 
{OkJ V^'^}- And so, D'^{ip'^^''^^) reduces to the asymptotic multipartite entanglement 
of assistance for pure states. 
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Proposition 5.4.2. Let iP^mAB arbitrary multipartite state. The quantity 

D^{ip'~^'^'^^) is hounded from above by the following quantity: 

where the minimum is over all bipartite cuts T and ip^'^\^'^ is a bipartite state with 
Alice holding the systems AT and Bob holding the systems BT. 

Proof Consider any cut T of the helpers {Ci, C2, ■ ■ ■ , Cm} and suppose Ahce (resp. 
Bob) is allowed to perform joint operations on the systems AT (resp. BT). Any 
protocol achieving D'^[ip'~^^'^^) consists of: 1) POVMs on the helpers followed by 
a transmission of the outcomes to Alice and Bob 2) local operations and classical 
communication between the systems A and B. This kind of protocol is contained in 
protocols allowing local operations on the systems AT and BT and classical commu- 
nication between the cut AT vs BT. Since the distillable entanglement across the cut 
AT vs BT cannot increase under local operations and classical communication, the 
optimal achievable rate for these protocols is given by D{il)^'^\'^'^). Since this holds 
for any cut T of the helpers, we are done. □ 

Definition 5.4.3. For an arbitrary multipartite state ^'^mAb ^ define the minimum 
cut coherent information as: 

T^,^iij,A:B) ■.= mmrI{AT)BT)^, 

where the minimization is over all bipartite cuts T C {Ci, C2, . . . , Cm}- 

Theorem 5.4.4. Let ip'-^^AB arbitrary multipartite state. The asymptotic mul- 

tipartite entanglement of assistance D^i^fj'-^''''^^) is bounded below by: 

D^^^c^rAB^ > max{I{A)B)^, r^.^i^jj, A:B)}. (5.14) 

Before giving a proof of Theorem [5331 we need the following lemma, which states 
that the minimum cut coherent information of the original state is preserved, up to 
a vanishingly small perturbation, after a helper has finished performing a random 
measurement on his system. The arguments needed for demonstrating this lemma 
are similar to those used to in [21] to prove eq. 05. ip . 
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Lemma 5.4.5. Given n copies of a state ^P'^mAB ^ perform a random mea- 

surement on his typical suhspace Cm as in Proposition \5.3.1i For any 6 > and n 
large enough, there exists a measurement performed by the helper Cm such that, with 
arbitrarily high probability, the outcome state ipj ^ " '""^ satisfies the following 
inequality: 

FmMj, : 5") > n{r^M, A:B)-5), 

where J is the random variable associated with the measurement outcome. 

Proof The minimum cut coherent information I^^^{ip,A : B) of the state %jj^MAB 
can be rewritten as 

: B) = minrc{c„c„...,c„}{^(Sr)4 - S{R)^, 

where R is the purifying system for the state ip'^^i^^ , 

Let T be a bipartite cut of the helpers {Ci, C2, . . . , Cm,} such that Cm ^ T. 
We define its relative complement as T' = {Ci, . . . , Cm-i}\T. For any such cut T, 
the state iJj'^mAb -^.g purifying system R can be regarded as a tripartite system 
composed of Cm, ART and BT' . Assuming S{ART)^ and S{BT')^, to be distinct, 
the helper Cm performs a random measurement on his typical subspace Cm- By 
Proposition 15 . 3 . ll and the Fannes inequality, there exists a measurement for Charlie's 
system for 
probability: 



system for which the outcome state ipj^ " """^ satisfies, with arbitrarily high 



mm{S{ARTh, S{BT%} - 6' < -^(5"r'")^, < mm{SiARTU, S{BT\,} + 5', 

(5.15) 

where 5' can be made arbitrarily small by taking sufficiently large values for n. Hence, 
the reduced state entropies stay distinct by taking a sufficiently small value of 5' . Since 
IminiS^J, : -B") can be re-expressed as 

Pm^^j^A- : B^) = minrc{c„c„...,c™_.}{5(i?'^r")^,} - (5.16) 
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we can substitute the lower bound for S{B'^T'^)^j into eq. (15.161) and obtain 

I'miS^j, A" : S") > nminr(min{^(Ai?r)^, S{BT%} - 5') - S^K")^, (5.17a) 
= n{minr{S{BrCm)^, S{BT%} - 6') - S{R^)^^ (5.17b) 
= n{mmr{S{Br)} - 6') - S{R')^j. (5.17c) 

To finish the proof, the last fact we need concerns the entropy of the purifying 
system R. If we consider the purified state ifj'^^i^^^ as a tripartite system composed 
of Cmi R and ABCi, . . . , C^-i, we can apply Proposition 15.3.11 and obtain, w.h.p: 

S{R^)^j = n{mm{S{R)^, S{C^, . . . , C^^^AB)^} ± 6") ^ ^ 

(5.18) 

<n{S{R)^ + 6"), 

where 6" can be made arbitrarily small. This tells us that for large values of n, the 
entropy of the purifying system will not significantly increase as a result of the helper 
Cm performing a measurement on his typical subspace Cm- Note that, as in Theorem 
15.3.21 we can use the union bound and Markov's inequality (see Lemma IA.6.2P to 
show the existence of a measurement on Cm which produces states such that, w.h.p, 
eqs. (15.15l) and (I5.18P are both satisfied. Combining the last equation with eq. (I5.17cl) 
and choosing values for 6', 6" small enough that S' + 6" < 6, we get the desired result. 

□ 

Proof of Theorem 15.4.41 The right hand side of eq. (I5.14p is just the coherent 
information when m = 0, and is equal to ma.x{I{A)B)^, L{ip)} for m = 1. Eq. (15.141) 
holds for these base cases by the hashing inequality and Theorem l5.3.2[ So, from here 
on, assume m > 2. Moreover, that D'^{ip'-'^'^^) is at least I{A)B)^ follows again 
from the hashing inequality. Hence, we can focus on proving that D'^{iIj'^'^'^^) is 
bounded below by the minimum cut coherent information. 

By Lemma 15.4.51 there exists a measurement Em for the helper Cm which pro- 
duces an outcome state V'j^ " """^ satisfying w.h.p. the following inequality: 

F^Mj, a- : S") > n{r^^{^, A:B)-5), 

for an arbitrary small 5 and sufficiently large n. If we have at our disposal rt^ copies 
of the state ip^^AB perform the measurement Em for each block of n copies of the 
state, we expect to obtain approximately n"^~^pj copies of the state tpj, with pj being 



131 



Chapter 5 



the probability of obtaining the state after the measurement Em is performed by 

Cm- 

For each block consisting of many copies of the state ipj, we repeat the previous 
procedure in a recursive manner. We continue this process until all m helpers have 
performed measurements on their systems. In the end, Alice and Bob will obtain a 
number of bipartite states each satisfying w.h.p. 

where 5' can be made arbitrarily small. Observe that the term on the left hand side 
of the inequality is the coherent information I{A)B)^ab , which is bounded above 
by the distillable entanglement D{ipf^^ j^). Since is a supremum over all 

LOCC measurements performed by the helpers, we have 

jlj2---jm 

>il-emM,A:B)-6'), 



where e and 6' can be both be made arbitrarily small by the arguments of the previous 
paragraphs. This concludes the proof. □ 

Before closing this section, let us say a few words on assisted distillation when 
the two recipients are separated by a one-dimensional chain of repeater nodes, as 
depicted in figure 15. 3[ Applying a hierarchical distillation strategy on ip^^*^^ will 
achieve a rate of ebits corresponding to 

R{ij) ■.= mm{I{A)Ci)^,,I{C2)D,)^,,I{D2)B)^,}. 

If we consider the cut 7i := {C} of the helpers C and D, the coherent information 
I{ATi)BTi)tp can be simplified to 



I{ATi)BTi)^, = I{C2)D,)^, - S{AC^)^, < I{C2)D,) 



■02 5 
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Figure 5.3: A 1-dimensional chain witli two repeater stations separating the two recipients 
A and B. 



and similarly for the cuts T2 = {CD} and Ts = 0, we have 

I{AT2)BT2)i, = I{D2)B).^, - S{ACi)^, - S{C2D^)^, < I{D2)B)^,, 
I{A%)Br3)^ = I{A)C,)^,. 

Thus, the minimum cut coherent information Imini'^^^'"^ ^ A : B) is not greater than 
R{ip), and so a hierarchical strategy might be better suited for the case of a chain 
state than a random measurement strategy, provided the information stored in the 
repeaters is not subject to errors (see the example of the previous section). It is easy 
to generalize the previous arguments to a chain of arbitrary length (i.e m > 3), and 
to other network configurations. 
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Conclusion 



6.1 Summary 

Following a brief review of quantum information theory in Chapter 2, we studied 
the problem of multiparty state merging. The main contribution of that chapter was 
to perform a direct technical analysis of the task of multiparty state merging. The 
main technical challenge to overcome was to adapt the decoupling lemma of [24] and 
the upper bound on the quantum decoupling error (Proposition 4 in [2l]) for the case 
of m senders and a single receiver sharing a multipartite state. Our upper bound is 
derived using a random measurement strategy for the senders. Our random coding 
strategy allowed us to use a well-known result from representation theory to simplify 
the calculations and thereby obtain a nice simple bound for the decoupling error. 
Using this calculation in the i.i.d setting, we proved the existence of protocols which 
achieve multiparty state merging without the need for time-sharing for the case of two 
senders and no side information at the decoder. We conjectured that time-sharing is 
not essential also for an arbitrary number of senders and discussed the main difficulty 
for proving this. 

We also introduced the split-transfer problem, a variation on the state merging 
task, and applied it in the context of assisted distillation. The main technical difficulty 
here was to formally prove that the decoding operations, implemented by two receivers 
A and B, can be done at the same time following simultaneous measurements by the 
senders. The essential ingredients for showing this were the commutativity of the 
Kraus operators Pj^ and and the triangle inequality. The rate region for a split- 
transfer is composed of two sub-regions, each corresponding to rates which would 
be achievable for a merging operation from T (resp. T) to A (resp. B). In the 
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context of assisted distillation, our split-transfer protocol was used to redistribute a 
pure multipartite state to two decoders A and B in such a way that it preserves the 
min-cut entanglement of the input state, provided the multiparty typicality conjecture 
holds. Under this assumption, we gave a non-recursive proof that the optimal assisted 
distillation rate is equal to the min-cut entanglement of the input state. 

In Chapter 4, an emphasis on how to accomplish merging when the participants 
have access only to a single copy of a quantum state was considered. The one-shot 
analysis was performed using the quantum min- and max-entropy formalism of [59] , 
and presented other difficulties than in the asymptotic setting. Most notably, because 
time-sharing is impossible with only a single copy of a quantum state, our intrinsically 
multiparty protocol provides the first method to interpolate between achievable costs 
in the multiparty setting. The technical challenge was to derive an upper bound 
on the decoupling error for a random coding strategy in terms of the min-entropies. 
We suspect that it might be possible to further improve our bound by replacing 
the min-entropies with their smooth variations, but it is unclear how to proceed in 
order to show this. We leave it as an open problem. To illustrate the advantages 
of intrinsic multiparty merging over iterated two-party merging, we have considered 
three different examples of one-shot distributed compression. Two of those examples 
demonstrate clearly the advantages of our protocol by allowing some of the senders to 
transfer their systems for free, something which is impossible for a protocol relying on 
two-party state merging. The last example considers a state which does not reduce 
to bipartite entanglement between various subsystems, and thus provided more of a 
challenge, but yielded greater rewards. Tractable computations for min- and max- 
entropies of non-trivial states are difficult to perform, and we feel this example is 
a useful contribution to the "one-shot" literature, which contains very few explicit 
examples. 

In the last chapter, we generalized the entanglement of assistance problem by 
allowing the parties to share a multipartite mixed state. For the case of three parties 
holding a mixed tripartite state, the optimal assisted rate was proven to be equal 
to the regularization of the one-shot entanglement of assistance, a quantity which 
maximizes the average distillable entanglement over all measurements performed by 
the helper. Two upper bounds for this quantity were established and examples of 
classes of states attaining them were given. Additionally, the one-shot entanglement 
of assistance was proven to be a convex quantity for pure ensembles. We also pre- 
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sented new protocols for assisted entanglement distillation, based on a random coding 
strategy, which are proven to distill entanglement at a rate no less than the minimum 
cut coherent information, defined as the minimum coherent information over all pos- 
sible bipartite cuts of the helpers. For states not saturating strong subadditivity, and 
recoverable by Alice and Bob if they can implement joint operations, we proved that 
our random coding strategy achieves rates surpassing the hashing inequality. More- 
over, the rates formally resemble those achievable if the helper system were merged to 
either Alice or Bob even when such merging is impossible. Finally, we compared our 
protocol to a hierarchical strategy in the context of quantum repeaters. We identified 
a major weakness of the hierarchical strategy by analyzing the effect of a CNOT error 
on the rates achievable for such strategy. We found that the rate, which can be as 
good as the rate of our random measurement, becomes null when such error occurs 
at the repeater node. On the other hand, our protocol is completely fault tolerant 
and yields the same rate even if this error goes undetected by the helper holding the 
systems at the repeater node. 

6.2 Future research directions 

Our proposed protocol in Chapter 5 for assisted distillation of an arbitrary mul- 
tipartite state involved a measurement on a long block of states, and then a measure- 
ment on blocks of these blocks, and so on. It seems likely that a strategy where all the 
helpers measure in a random basis of their respective typical subspaces and broadcast 
the results to Alice and Bob would still produce states preserving the minimum cut 
coherent information. For pure multipartite states, we showed in Chapter 3 that a 
split-transfer, followed by a distillation protocol between Alice and Bob, removes the 
multiple blocking required in [21] for distilling an optimal number of ebits between 
Alice and Bob. For an arbitrary multipartite state, we are still unsure if our split- 
transfer can be applied to remove the multiple blocking argument needed to show the 
lower bound. More generally, it would be interesting to come up with other potential 
applications for the split-transfer protocol. State merging was used as a building block 
for solving various communication tasks, and we believe split-transfer could be useful 
in other multipartite scenarios than the assisted distillation context. Alternatively, it 
could also simplify some of the existing protocols which rely on multiple applications 
of the state merging primitive. 

We could extend our assisted entanglement scenario in several ways. For instance. 
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we can consider other forms of pure entanglement such as GHZ states and look at 
the optimal achievable rates under LOCC operations. Another interesting question 
is to analyze whether general LOCC operations between the parties give more power 
to the helpers. In the pure multipartite case, we saw that such a strategy is not 
required to achieve optimal assisted rates. For multipartite mixed state, we should 
expect a difference in achievable rates when allowing more communication freedom to 
the helpers. We just have to consider bipartite distillation protocols to see this: the 
hashing protocol is impossible without communication between the parties. Finally 
another potential line of research is to analyze our assisted protocol using smooth 
min- and max-entropies. Recent work by Buscemi and Datta [85] analyzed the one- 
way distillable entanglement of a bipartite mixed state ip^^ in the one-shot regime 
using one-shot entropic quantities similar to the quantum min- and max-entropy of 
[59] . The entanglement of assistance for pure states ip'^^'^ was also analyzed under 
this framework [86], and it is another natural progression of our work to analyze the 
entanglement of assistance using the quantum min- and max-entropy formalism. 

Quantum min- and max-entropies are quantities which require an optimization 
over an infinite set of objects, and thus, do not lend easily to computation. In [61], an 
approach for computing these quantities was suggested in terms of semidefinite pro- 
gramming. In Chapter 4, we gave three examples where computation of min-entropies 
are tractable and have a relatively simple form. Finding other examples of classes of 
states for which their min- and max-entropies can be evaluated or characterized in 
more simple terms would be a welcomed addition to the literature on this subject. 

To conclude, theoretical investigations of multiparty protocols are difficult to per- 
form. In this thesis, we suggested new multiparty quantum communication protocols 
and performed a rigorous analysis of their properties when such analysis could be 
carried. The multiparty typicality conjecture is the most important open problem 
we leave on the table. We suspect that a proof technique for answering this conjec- 
ture in the affirmative will also be beneficial for several other multiuser information 
processing tasks [871 EHl ESj • As the design of these protocols become more complex, 
better techniques will be required for analyzing these protocols. 
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Various Technical Results 



A.l Trace norm 

Lemma A. 1.1. [24^ For an operator X acting on a space A, the Hilhert- Schmidt 
norm \\X\\2 is related to the trace norm as follows: 

\\X\\l<d\\X\\l (A.l) 

where d is the dimension of the support of X . 

Proof This follows straightforwardly from the convexity of the function by taking 

probabilities 1/d. □ 

A. 2 The swap operator 

Let A and A be any two isomorphic vector spaces of dimensions dA- Consider the 
computational bases and of the spaces A and A. The swap operator 

F^^ is defined by the following action on the basis elements {\i)^ ® \3)^} oi A® A: 

F^^\^%)^ := l<i,3<dA. 

For any two arbitrary vectors = Yl'i=i o^iK)"^ I'P)^ = Yl'jti "we have 

where \<jy)^ := E?=i AK)^ and := E-ti^^jlj)^- 
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The swap operator has many interesting properties. It is a unitary operator which 
is also hermitian. To see this, consider two basis elements and \kl)^^. We have 

{zj\F^F\kl) = {jt\lk) = S,,A,,, 

Prom the last line, it follows that the eigenvalues of F^^ are 1 and -1. Denote by 
the subspace spanned by the eigenvectors {le^^"^)^^} of F^^ with eigenvalues equal 
to 1. We call the symmetric subspace of A A. We sometimes use the symbol 

^sym to denote the projector onto this subspace. It will be clear from the context 
which definition applies. The subspace spanned by the eigenvectors {le""*')"^^} of 
F^^ with eigenvalues equal to -1 is written as 11^^^ and is called the anti-symmetric 
subspace of AA. This decomposes the space AA into two orthogonal subspaces. 

Lemma A. 2.1. Let A and A be any two isomorphic vector spaces of dimensions dA- 
Then, we have 



= span 



G A 



with = dA{dA + l)/2. 

Proof That the right hand side is contained in the symmetric subspace follows from 
the definition. It remains to show that any vector l?/))"^"^ e H^^"^ can be written as a 
linear combination of vectors of the form Write Itp)"^^ in the computational 

basis: 



Hence, we have aij = aji for all I < i,j < dA- We rewrite the state |?/;)^^ as 

l<i<j<(ix i=l 

dA 

l<i<j<dA i=l 

(A.3) 

This proves the first statement. (Observe that the previous set of vectors + 
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+ are also linearly independent.) To obtain the second 

result, notice that in the first line of the previous equation, the state is written as 
a linear combination of orthogonal vectors + \ji)^'^}i<i<j<dA 

Since the state was arbitrarily chosen, these vectors must generate the symmetric 
subspace. We have {^^) + dA = dA^dA + l)/2 of these vectors. □ 

Observe that the preceding equation implies this orthonormal basis for the anti- 
symmetric subspace: - |j^)^^)}i<i<i<dA- 

Corollary A. 2. 2. Let U he a unitary operator acting on a space A of dimension dA- 
For any two vectors in lij^ and € IlfJ-, we have {U ® U)\ip)^^ G 11^^^ 

and {U ® U)\(j))'^^ G ^anti- Here, the unitary U is a "copy" version of U : if U\i)^ = 
l^Pi)^, then U\tf = 

Proof From the preceding lemma, we write the vector as 

l<i<j<dA 
i 

Applying the unitary {U ® U) to this vector, followed by the swap operator F^^, we 
have 

F^^iu^um^^ = F(j2mih)''u\ih)^) = J2m^^)^u\^p,)^. 

i i 

Hence, the vector {U ® U)\ip)^'^ is in the symmetric subspace. To prove the second 
statement, we proceed similarly by writing the vector \(t))^^ G ^anti 

l<i<j<dA 



dA 



AA 



i=l 



140 



Appendix A 



Applying the swap operator to {U ® f/)|0) , we get 

^<i<j<dA 



l<i<j<dA 



AA 



Hence, the anti-symmetric subspace is also invariant under unitaries of the form 
(U^U). □ 

Lemma A. 2.3 (The swap trick). Let be any operator acting on a vector space 
A of dimension dA- Then, we have 



A\ T?AA-\ 



Tr[(X^)^] = Tr[(X^®X^)F 

where is a "copy" of X acting on an isomorphic space A of A. 
Proof Let's expand [X'^Y (we remove some of the notation for clarity): 

ij pq 

= Y\ij\pg6jji){q\. 

ijpq 

= Y\ip\pq\i){q\. 



(A.4) 



ipq 



Hence, Tr[(X^)2] = Eg=i KpKq- Now, expand the right hand side of eq. dXi]) : 



Tr[(X^®X^)F^^] = Tr 



Tr 



YXijXpq\ip){jq\F 

ijpq 

YXij\pq\pi){jq\ 



ijpq 



ijpq 
(Ia dA 



p=l q=l 

Ti[{X^f 



pq 
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□ 

Lemma A. 2. 4. Let A and R be any two arbitrary vector spaces of dimensions dA 
and dn respectively. Then, we have 



pAR,AR _ pAA ^ pRR 



(A.5) 



where AR := A R. Here, A and R are isomorphic vector spaces of A and R 
respectively. 

Proof The elements of the computational basis {\k)^^}'^^T^ can be rewritten as 
{|u)^^}i'=ijLi by identifying k with dA{j — !) + «. We have 

pARAR\-^^AR^^^^AR. ^ \mn)'^R\ij)^ 

= \m)^\n)^\i)^\3f 
= \m)^\i)^\n)^\])^' 

Since this holds for any 1 < i,m < dA and 1 < j,n < d^, we are done. □ 

A. 3 Averages over the unitary group 

Lemma A. 3.1. Let {\ek)}'l=i be an orthonormal basis of a space A. For alli,j with 
1 < ^5 j < dA, we have 



U\ei){ej\^UUU = 6, 



U(A) 



d, 



(A.6) 



where the average is taken over the unitary group U(A) using the Haar measure. 
Proof Let P(|ej)(ej|) = /^^j,^^ t/|ej) (e^ |'^f/"''c/t/. Using the linearity of the trace, we 



have 



Tr 



P{\e^){e,\) 



Tr 



U(A) 



Tr 



V(A) 



f/|e.)(e,-|^f/t 
f/^t/|e.)(e,-|^ 



dU 
dU 
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where the last step follows since WU = and dU = 1. (The Haar measure on 
a topological compact group is also a probability measure). The operator P{\ei){ej\) 
is hermitian. To see this, choose any unitary operator V such that V^lcj)"^ = \ej)^ 
and V\ej)^ = |ej)^ and use the right invariance of the Haar measure on the unitary 
group U(y4) ^: 

P(|e,)(e,|) = P(\/|e.)(e,|0) = / UV\e,){e,\^V^UUU 

JV{A) 

= I U\ej){ei\^U^dU 

JV{A) 

= n|e.)(e,|)^ 

Finally, write P{\ei){ej\) in its spectral decomposition as Yl!k=i^k\l^k){l^k\^ and let 
TT : {1, 2, . . . , c?^} — 7- {1,2,..., d^} be any permutation of the set {1, 2, ... , d^}- Define 
the unitary such that Kr|/ifc)^ = |/i7r(fc))^ for all 1 < /c < dA- Using the left- 
invariance of the Haar measure, we have 

^M^^^{k)){^^^{k)\ = V^P{\ei){ej\)V^ 

k 

= P{\ei){ej\) = ^Afc|/ifc)(/ifc|. 

k 

Since the eigenvectors \fik) are orthogonal, we have 

A.-i(fc) = Tr[KP(|e.)(e,|)V;t|/i,)(;,,|] = Tr[P(|e.)(e,|)|/ifc)(/Xfc|] = A, 

for any permutation it. Hence, all the eigenvalues of P(|ej)(ej|) must be the same, 
and since Tr[P(|ej)(ej|)] = 6ij, we either have A^ = for all k when i j or Xk = 
for all k when i = j. □ 

Lemma A. 3. 2. Let ip"^^ be any mixed bipartite state of the system AR. We have, 

[ (t/ ® ® = ^ ® V^^, (A.7) 

Jv{A) dA 

where the average is taken over the unitary group U(A) using the Haar measure. 

Proof Given orthonormal bases {|ej)}f^]^ and {\fi)}i=i of the systems A and R, 
^Thc Haar measure is left and right invariant for compact topological groups. 
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write the state ip^^ as Xlijfei ^ijki\eifj){ekfi\^^- Using the hnearity of the integral, we 
have 



V{A) 



ijki Ma) 



ijkl 



U\ei){ek\^UUU 



U(A) 



jj7 



R 



where we have used Lemma fA.S.ll to get the fourth hne. 



□ 



Lemma A. 3. 3. Consider any arbitrary mixed state ip'-^'^'^ of the systems Cm o-nd R, 
where Cm = Ci (g) C2 ® . . . ® Cm- Let Qi he a projector of rank onto a subspace C} 
of Ci and Ui a unitary acting on Ci. Define the sub-normalized density operator 



where Um := f/i ® f/2 ® • • • ® Um- Then, we have 

.C\.,R 



U(Ci) ^U(C2) ^U(C™) 



(A.8) 



where the average is taken over the unitary groups U(Ci), U(C2), . . . , U(Cm) using the 
Haar measure. Here dUM = dUidU2 ■ ■ ■ dUm, with j^iQ.^dUi = 1 and the average is 



over unitaries of the form Ui®U2®---®U„ 
Proof Let 



U(C„) 



and write the integral J^/^^ Iv(C2) ' ' ' Aire ) as /jj/^^^^) dUM- Additionally, define 
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Qm '■= Qi®Q2® ■ ■ ■®Qm- Using Lemma lA.3.2t we can simplify each of the integrals, 
starting with the inner most one: 



Uj''MR{UM)dUM 



U(Ca/) 

Qm 



M 



U(Ca/) 



Q 



M 



Q 



M 



U(Ca/-i) 



Q 



M 



Q 



M 



TCr, 



V{Cm-i 



' d, 



O77 



d: 



Qm 

Qm-1) 



where in the last line we have used the fact that 

jCm. jCm 

Qm~i Qm ~; ■ , 



'm 



Continuing in this way for the other integrals, the left hand side of eq. ( lA.Sp is 
eventually equal to the state 



dn, dnn dc, 



R 



Cm 



where we recall that Lm '■= YliLi Li and dcm '■= YliLi dc,- D 

For the following lemmas and the main proposition, we denote by D{X) the 
average f^^^^{W ®W)X {U ®U)dU . We will sometimes drop the superscript notation 
for clarity. 

Lemma A. 3. 4. Let 1 < i < j < dA o,nd 1 < k < I < dA- If i ^ k or j ^ I, we have 



{U^ ® U^) ( ± ] ( {kl\ ± {lk\ ] {U ® U)dU = 



U(A) 

Proof If z 7^ A; and i ^ I, consider the unitary V which flips the sign of \i) and fixes 
all other basis vectors. Using the left invariance of the Haar measure, we quickly see 
that these averages must be zero. If z 7^ /c , but i = I, use the left invariance with the 
unitary W which flips the signs of the vectors \i) and \k). Since j > i = I > k, the 
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averages must once again be zero. The other cases are treated similarly. □ 
Lemma A. 3. 5. Let 1 < i < (Ia and 1 < k < I < d^. We have 

(f/t ® U^)\ii) {{kl\ ± {lk\] {U ® U)dU = 



U(A) \ / 

{U^ ® U^)(\kl) ± |/A;) ) {ii\{U ® U)dU = 

Proof li i ^ k, then apply the left invariance property of the Haar measure with a 
unitary which flips the sign of the vector \k) and fixes all other basis vectors. U i ^ I, 
use the left invariance property of the Haar measure with a unitary which fiips the 
sign of |/) and fixes all other basis vectors. □ 



The following proposition was first proven in [2lj. We give a different proof in 
this appendix which does not rely on Schur's lemma (see pg.37 in [90]). Our proof 
suggests that the following result may hold for other measures than the Haar measure. 
That is, any measure which is invariant under permutations, sign-flip operators (i.e 
V\i) = — and Hadamard operations will satisfy the following proposition: 



Proposition A. 3. 6. [2^ Let A and A be any two isomorphic vector spaces of di- 
mensions d^, and let X he any operator acting on the tensor space AA. Then, we 
have 



[ ([/t ^ u^)x{u ® u)du = ^!^f "l7^ n^i + nil, (a.9) 

where U is a "copy" version of the unitary U acting on the space A. 

Proof Consider the representation of X in the orthonormal basis with vectors 

{\ii)}i=i , + \ji))}i<i<j<dA and {^(lij) - \ji))}i<i<j<dA- To simplify the 

notation, we impose some ordering on the basis elements and write the vectors in 
as |e^^"^) and the vectors in H^^^ as je^""*). We have 



V V «J 
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Using linearity of the integral, we have 

ij ij 

+ 5^a,,D(|er)(eri) + E "^^'^d^r*) (ef* I) 



Using Lemmas IA.3.41 and IA.3.51 the previous equation simplifies to 



antiJ ' 



(A.IO) 



Write -D(n^J^Xn^'^) as f^ijWi^"^) Using the right invariance of the Haar 



sym sym 

measure, we have 



V®VD{IllixU%t) 



for all unitaries V acting on A, with V being a copy version of V acting on A. As in 
Lemmas IA.3.41 and IA.3.51 we can choose carefully our unitary , fiipping some of the 
vectors in the appropriate way, so that the previous equation implies 

liij = —fiij for all 1 < j < dA{dA + l)/2 such that i ^ j. 

Hence, the operator D(]lf^XIlf^) is diagonal. To show that it is also a multiple of 
the projector onto the symmetric subspace, consider the unitary V which permutes 
the basis elements ^(|12) + |21)) and + 1^^))- Then, the right invariance of 

the Haar measure implies that 



Tr 



\ sym symJ 



V|12) + |12)\ /(12| + (21| 



V2 



Tr 



\ sym sym) 



\kl) + \lk)\ f{kl\ + {lk\ 



72 J\ y/2 
for a\\ 1 <k <l < d^. We can proceed similarly and show that 



Tr 



^(niixH4i)|u)(u| 



Tr 



^(n4ixH4i)|jj)(jj| 
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for all 1 < J < d-A- The last thing we need to prove is that 



Tr 



D{KtXUf^l)\U){U\ 



Tr 



This is proven using the right invariance of the Haar measure with a unitary V, 
which transforms the vectors |1) and |2) to the vectors -^(ll) + |2)) and — 
1 2)), while fixing all other basis vectors. Combining the previous facts, the operator 
D(n±ixn±i) is a multiple of Uf^ ■ 



sym' 



\ sym sym) 



An 



AA 
sym' 



Using the linearity and the cyclic property of the trace, we have TT[D(Ilf^XIlf^^)] 
Tr[n4iX],and so 



A 



By a similar argumentation, we have 



Tr[n4iX] 
Tr[n4i] ' 



TjAA 



Tr[nil] 



antii 



and so we are done. 



□ 



Proposition A. 3. 7. Let A he a vector space of dimension d^ and consider a subspace 
Ai of dimension L. For the swap operator F^i^i , the previous proposition evaluates 
to 

I (f/t ® U^)F^^\U ® U)dU = ^/^^ + ^j^^t'll ^^^ 

Jv(A) dA[a^ - I) dA[d^ - I) 

Proof The swap operator F^^^^^ can be expressed as Hfy^^ — U^^^^ . We have 
K'i' e nji and ni^,t e nii. Hence, 

Tr[F^^^^nf/j = L(L + l)/2, 
Tr[F^^^^nii] = {L- L')/2. 
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Using the previous proposition, we have 



(f/t ® u^)F^^^^{u ® u)du = 4i^i±^ui± - ^^t^JLn^^ 



U(A) 



dA{dA + iy 



sym 



dAidA - 1)" 



anti 



L{L + 1) I^^ + Ffyi L{L-1) I^^-F^^ 



dA{dA - 1) 2 



dA(dA + 1) 2 
^ L{dA - L) LjLdA -l) pAA 

dA{d\ - 1) c^a(4 - 1) 

(A.12) 

□ 

A. 3.1 Convexity of Da for pure ensembles 

Lemma A. 3. 8. For a state tp^^'-^ = '^^Piipf^'", where tpf"^^ are pure states, let 
F = {Fx}^^-^ be a POVM of rank one operators on the system C . Then, we have 



AB\ 
X )i 



(A.13) 



X,l 



where i,^^ = ^Trc[(F,. ® 1^^)^/;^^% = Tr[F,^^] and = ^^^Trc[(F, 



Proof The state we get after applying F^ on system C is given by: 
1 



-Tr, 



qx 



c 



J A \ Q.X 



ABC} 



P'i^'^cjFxW] JAB 

a ■ 



(A.14) 



Since qx = Ttc[Fx4''"] and J^iPii^? = i^"^ ^ have a well-defined ensemble of pure 
states on the right hand side of Eq. ( 1A.14I) . Since the distillable entanglement is 
bounded from above by the entanglement of formation, we get 



^g,D(-Trc 



Fx® I 



AB 



ABC 



Qx 



J2p^^^^c[Fx^pf]D{^Pi 



AB\ 
X ) 



(A.15) 



□ 
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Proposition A. 3. 9 (Convexity of Da for Pure Ensembles). Let ip^^'^ be an arbitrary 
tripartite state. Then, for any convex decomposition {Pi,'ipf^'"} of ip^^'^ into pure 
states, 

< Y.p.DAi^f''^'). (A.16) 

i 

Proof For any z/ > 0, there exists a POVM E = {E^}-^^^ of rank one operators 
sucli that 

Y.lMi^^'') > - ^, (A.17) 

X 

where ^jJ^^ = -^Ttc[{Ex ® I^^)ip"^^^]. From the previous lemma, we have 
J2<1^D{^^^) < 5^p.Tr[E,.^f]D(^^4f ) 

i X 

< Y,P,DA{^f (A.18) 

i 

where ipi^x = Trc[-E xi)'^] ^'^c[{Ex ® is the state obtained after performing 

the POVM E on the state ipf^'^'. Since u was arbitrarily chosen, we get back the 
statement of the proof. □ 

A. 4 Typicality 

Lemma A. 4.1. For n copies of a state iP'^mBR^ j^^^ n^^^, H^j^, . . . , H^^;.^, 11^ be the 
projectors onto the 5— typical subspaces B, Ci, C2, ■ ■ ■ , Cm o^nd R respectively. Then, 
we have 

^bcmr := ® n^^ ® . . . ® n^;^ ® > + n^^ + . . . + n^;^ + - (m + i)Ibc,,r, 

(A.19) 

where II is a shorthand for 11^ ® /C'iC'2- -C''"^^ and similarly for Hq-^^, Hj^^, • • • , 
and n^. 

Proof The projection operators involved in the proof statement pairwise commute, 
and thus, are simultaneously diagonalizable. Let {|ej)} be a common eigenbasis for 
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these projectors. Then any eigenvector \ei) with ^bcmR^^^"^ ~ 1^*) satisfies 
(u^ + Uc^ + ... + + - (m + 1)Ibc,,r^ |e.) = |e,). 

If |ej) is any eigenvector with Il^^^^j^\ei) = 0, then it must be in the kernel of at least 
one of the projection operators 11^, H^i^ ^C2^ • • • ' which implies that 



+ + • • • + n^.;^ + - (m + 1)/bcm/?J = ^ile^), 
where Aj < 0. Using both of these observations, we have 

^bcmR= Y1 |e*)(ei|> |ei)(ei|+ Yl ^i|e.)(ei| 

nscj,^jjl<^»)=|e») n^^^^^|e,)=|ei> n^cMfll'='>=° 

= + n^-, + • • • + n^;^ + - (m + i)Ibc,,r 

(A.20) 

□ 

Lemma A. 4. 2. (Hoeffding's inequality) For i.i.d random variables Xi,X2, . . . ,Xn, 
with Xi G [ai,bi], we have 



Pi 



> t) < 2 exp 



-2t' 



Lemma A. 4. 3. Let ip^ = Yli=iPi^i)\^i){^i\- For n copies of this state, and any 
6 > 0, we have 

Tr[V'f"n;^''^] > 1 - 2rfexp(-2n52). 
where If^^ is the projector for the S— typical subspace for ^/'f". 

Proof This follows from the previous lemma by using the union bound on Tr[?/;^"'n^''^] 

Lemma A. 4. 4. Let p be a state on X and let both Ux and Uy be orthogonal 
projectors acting on X and Y , respectively. Let Q = {Ux ® ny)p(nx (8> Hy). Then 
< Uxpx'n^x- 
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Proof The statement is equivalent to demonstrating that for all {ip\Qx\'ip) < 
{ip\IlxPx'n.x\'^), which can be seen by direct calculation. Let Ily = I — Ily. 

{i/j\UxPxIlx\^) = Tt[{\^){^\^I){Ux^I)p{IIx^I)] 

= Tr ® (Hy + Wy)) {Ux ® I)p{Ilx ® /)] 

= {mx\^) + Tr ® n^) {Ux ® I)p{Ilx ® /)] 

> (mx^. 

The inequality follows from the fact that Tr[y4i?] > whenever A,B > 0. □ 

Proposition A. 4. 5. Let ip^^'^'^ he an arbitrary mixed state and consider n copies of 
it. For any e > and n large enough, there exists a state \l/*^i*^2 njfiicfi satisfies 

'Xr[(vl/'^i'^2)2] < (1 — g)-22-"('S'(CiC2)^-w) 
Tr[(\l/'^^)^] < (1 — e)~22-n{s{Ci)^-3<5i) 
Tr[(^^2)2j < (1 - e)-22-"(s(^2)^-3'5i) 
rank < 2"(^(^»)'^+^i), 

where 6i and v can be made arbitrarily small by taking sufficiently large values of n. 
Here, z/(e) is a function of e which vanishes as e — 0. 

Proof For any 5i > 0, with 1 < i < 2, define the states: 

af'^' := U'}'^' (g) m'^'ij^'' ® n^^'^^ 



rClC2 TtS2,'52 /^ClC2N®S2rr*2.'52 

'2 ^^CrC2^^^ > ^^C^C^ 



where 11^'^ is the projector onto the 52-typical subspace for the state (o'f^'"^' 
Using Lemma [A. 4. II and Hoeffding's inequality, we have 

Tr[ng;^^ ® ng/^V^^- J > 1 - 2(rfc, + dc^) exp(-2s,<5,2) 

> 1 — cexp(— 2si5^) 
Tr[nj.|an > 1 -2(4Ma)exp(-2s252^) 

> 1 -c"iexp(-2s252) 
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for some constant c > 0. The distance between o"^^*"^ and ijj'^'^ is bounded using the 
triangle inequahty and the Gentle Measurement Lemma: 

< exp{-S26i) + sallaf^^^ - (A.21) 

< 2\/c^exp(— 52^2) + S22i/cexp(— 

For a fixed value of si, choose S2 to be such that 

e := cexp(-2si5i2) = c'' exp(-2s252) (A.22) 
Taking the logarithm on both sides, this is equivalent to: 

S2 = ^(2si5i + si ln(c) - ln(c)) 
^2 = ^(^i(25? + ln(c))-ln(c)) 

Replacing S2 into eq. (]A.2ip and using eq. (lA.22p . we have 



^c^C2 _ ^®n||^ < 2Vc^exp(-S252) + S22v^exp(-si(5i2) 



1 



2ycexp(-si(5i) + -2 (si(2(5i + ln(c)) - ln(c))v/cexp(-si(5i; 



"2 



which vanishes for sufficiently large values of Si. Let ^''^^'^^ normalized state 

of cr^^'"^. Using the triangle inequality, we have 



I^CiC^ _ ^®n||^ < 4y/dexp{-si6l) + ^2 (si(25i2 + ln(c)) - ln(c))v^exp(-si52) 

^2 



which also vanishes as si — 00. To bound the quantity Tr[(\E''^^'-^2)^j^ -^g have, using 
typicality (see eq. (12.17P ). 

Since the state af'^^ is close in the trace distance to the state ip'^^'^, we can apply 
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the Fannes inequality, obtaining 
Substituting into eq. flA.23p . we have 



Tr[(^<^iC'2^2| < (1 _ g)-22-^2(5(C'iC'2)ai-3<52) 

< (^l — g-)-22-'52Si(5'(CiC2)v,-r;(€)log(dci'^C2)-3^) (A. 24) 

< (1 _ ^y22~n{S{CiC2)^-v) ^ 

where v will vanish as si increases. To obtain a bound on Tr[(\l/'^^)^], we have 

a^'^' < {af'^'f. (A.25) 

Since the partial trace preserves the previous ordering and Tr[yli?] > for two positive 
operators A and B, we have 



= Tr[(afi)2]^^ 

^ 2~n(S(Ci)^-3Si) 

where the third line follows from Lemma IA.4.41 Finally, we have 

Tr[(^^i)'] < (1 - ey^Ti[{{af'f')^] 

< (1 - g)-22-n{S{Ci)^-35i)_ 



(A.26) 



(A.27) 



We can apply a similar set of inequalities for the quantity Tr[(cr^^)^] and obtain 

Trii^^^r] < (1 - 6)-2Tr[((ap)«-)2] 

< (1 _ g)-22-n{S(C2)^-3<5i)_ ^ ■ 
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Since a^^ < ((Tf')®"^^ the rank of can be bounded as follows: 



rank < (rank a^^Y^ 



The second line follows from Lemma rA.4.4[ We have a similar calculation for rank '^'-'^ , 
and so we are done. □ 



A. 5 Smooth max entropy 

Lemma A. 5.1. Suppose the density operator p has eigenvalues r = (ri, . . . ,rd) with 
> ''"i+i- Then 



Proof By eq. fl2.23p (see Chapter 2), H^^^{p) is equal to the minimum of H^i^^ij)) 
over all sub-normalized density operators p such that P{p,p) < e. From eq. f l2.20p . 
we can bound the purified distance from below by the trace distance: 



Since the smooth max entropy H^^J^p) is the minimization of Hj^^xip) over all sub- 
normalized density operator p with P{p, p) < e, the previous bound implies 



where H^^^{p) the minimization of the max entropy H.^^y^{p) over all sub-normalized 
density operators p such that ||p — p||i < 2e. Let p be a sub-normalized density 
operator such that H^^^{p) = H^i^^ij)). Let r = (fi,...,fd) be the eigenvalues of 
p, ordered such that Vj > fj^i. We will identify r and r with their corresponding 
diagonal matrices. Then, we have (see the proof of the Fannes inequality in [35]) 




(A.29) 



P{prp)>i^\\p-p\\i- 



HIM > mm{H^Up):\\p-p\U<2e}=:H^^M 



(A.30) 



— '^lli < \\p — p\\i ^ 2e. 
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Figure A.l: The square root function evaluated at four different points. The derivative 
of ^/x is a non-increasing function for x > 0. Since we have sj^ > Sjo+i by 
hypothesis, tlie finite difference Ai cannot be greater than the finite difference 
A2. 

Thus, without loss of generality, we can assume that p and p are diagonal in the same 
basis. We therefore dispense with p and p, discussing only r and r from now on. 

By eq. fl2.24p . we have H^^^{r) = 2 log J2j \f^ji which is monotonically decreasing 
in each r^. This implies that a minimizing r must satisfy < r^. If not, redefining 
Tj = Tj decreases ||f — r||i and -f^max(^) at the same time. 

We will now argue that there is a minimizing r with the following property: there 
is a jo for which = fj for all j < Jq and rj = for all j > Jq. Let s = (si, . . . , s^) 
be any vector such that Hma^{s) = H^^^{p), with Sj > s^+i > and Sj < rj. 
That such a vector exists follows from the arguments in the previous paragraphs. 
Suppose that s does not have the prescribed form. That is, there is a jo such that 
< but Sjo+i > 0. Note that this implies that sj^ > since sj^ > sj^+i. Let 
^ = minjrjQ — Sjo, sj^+i} be the minimum between Sj^+i and the difference between the 
eigenvalues rj^ and Sj^. Define the vector s' such that s'. = + ^, s',^^^ = Sj^+i — ^ 
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and s'j = Sj for j ^ {jo, jo + !}• If ^ = Sjo+i, the trace norm ||r — is equal to 



io-i 
io-i 

If ^ = — Sjg, the trace norm ||r — is equal to 

io-i 

= Ik - 

Hence, our new vector s' preserves the trace norm ||r — The derivative of the 
function ^/x is given by 

fV^=^, (A.31) 
ax 2a/x 

which is well-defined for any x > and is also a non-increasing function of x. Since 
■Sjo+i ^ ■5jp, we have (see Figure lATTj) : 



and so i/max(s') < -f^max(s)- If -f^max(s') is Icss than i/max(s), wc havc a contradiction 
and s must have the prescribed form. Thus, assume from now on the max entropies 
are equal. If = -^jo+i' vector s' has almost the prescribed form. We have 

s'jo+i = 0, but it could be that s'j > for any j > jo + 1- Let s" be the vector such 
that = s'j^_^_2^^'jo+2 = s'j = s'j for all other values of j. Then, the trace 
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norm H^' — is equal to ||r — as can be seen from the following equations: 

d 

II //II \ I I I " I I " \ 

\\r - s 111 = \rj - Sj\ + |rj(,+i - s^q+iI + rjo+2 - -^io+sl 

jy{io+i,io+2} 

d 

jy{jo+i,io+2} 

d 

Yl ~ 4) + '"io+l + (^"70+2 - 4+2) 

jy{jo+i,io+2} 

The third line is obtained using s'j^j^2 < ^jo+'i < "'^jo+i- Thus, we can push back s'- , ^ 
until another zero value is encountered. That is, there exists a vector s such that 
Sj > Sj+i, with Sj = Tj for all j < jo, Sj^ = s'j^ and Sj = for all j > k , where 
jo < k < d. If Sjo+i > 0, we apply the previous argument until the prescribed form r 
is obtained. 

If Sjo+i > ^, we have s^^ = and we can repeat the previous argumentation 
with Jq = jo + 1. Since jo is at most d, we will eventually find a vector r of the 
prescribed form. The statement follows by evaluating the max entropy for this vector 
r: 

^max(p) > ^max(p) = ^max(r) 

f k-1 d 

> 2 log min < : k such that rj < 2e 



. i=l j=k+l 



□ 



A. 6 Assisted distillation 

Lemma A. 6.1 (Markov's Inequality). If X is a random variable with probability 
distribution p{x) and expectation E{X), then, for any positive number a, we have: 

P(|A-| > „) < 

a 

Lemma A. 6. 2. Suppose we have n copies of the pure state ip'^^^^^ with S{R)^ < 
S{AB)^ and S{B)^ < S{AR)^. Let be be the normalized state obtained 
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after projecting the space C" into its 6— typical suhspace C . If Charlie performs a 
(rank one) random measurement of his system C, we have, for any fixed > and 

/ P (ll^f - < ei n ll^r - < 6) rff/ > 1 - «, (A.32) 



'U(C) 

where a can he made arbitrarily small by taking sufficiently large values of n. Here, 



J is the random variable associated with the measurement outcome and is 



the pure state of the systems A^^B"" and i?" after Charlie's measurement. 

Proof The proof of this statement is obtained by combining Proposition 15.3.11 
with Markov's inequahty and Boole's inequahty (the union bound). For any > 
and .^2 > 0, consider a projective measurement of Charhe with rank one projectors 
U\i){i\U'' and let J be the measurement outcome. We want to bound the following 
probability from below: 

Pj := P(||z^f - < 6 n ll^f - < 6) > 1 - « (A.33) 

for any a > 0. Applying the union bound and Markov's inequality to such probability, 
we have 

Averaging over all unitaries, using the Haar measure, we get 



PjdU > 1 - 

V{C) SI « 



(A.35) 



The averages are not quite of the desired form to apply Proposition 15.3.11 directly. 
Define the state 

and let [v]/)'^^^^ be the normalized version of \Vl)'~^^^^. If Charlie were to perform his 
measurement on the state \E', the properties of typicality tell us that the trace norms 
11^^" — \E'j^||i and \\ipf'^ — ^^||i should be arbitrarily close. This is verified by using 
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the bounds between the trace distance and the purified distance, eq. f l2.20p . and the 
monotonicity of the purified distance under trace non-increasing quantum operations 
(see [SD] for a proof of this fact): 

lie - *f 111 < ||^f^"^"«" - vl/f ^^11, 

for any e > by choosing sufficiently large values of n. The last line follows from 
typicality and the triangle inequality. A similar statement holds for the trace norm 
W'lpf" — 111- Applying the triangle inequality twice on each average of eq. (]A.35p . 
we have 



/u(c) E.P.II^f - ^%dU ^(^) E.P.II^f - ^^lli^^ 

ivic) ^1 ^2 

(A.36) 



[ PjdU > 1 - /(e) - 



where /(e) is a function of various trace norms which vanish, by typicality, for suf- 
ficiently large values of n. Applying Proposition 15.3.11 on the averages of eq. ( 1A.36I) . 
we can make the right hand side bigger than 1 — a for any a > by choosing n 
sufficiently large. □ 
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